Skip to main content
Entropy logoLink to Entropy
. 2020 Jun 9;22(6):642. doi: 10.3390/e22060642

Lie Group Statistics and Lie Group Machine Learning Based on Souriau Lie Groups Thermodynamics & Koszul-Souriau-Fisher Metric: New Entropy Definition as Generalized Casimir Invariant Function in Coadjoint Representation

Frédéric Barbaresco 1
PMCID: PMC7517177  PMID: 33286414

Abstract

In 1969, Jean-Marie Souriau introduced a “Lie Groups Thermodynamics” in Statistical Mechanics in the framework of Geometric Mechanics. This Souriau’s model considers the statistical mechanics of dynamic systems in their “space of evolution” associated to a homogeneous symplectic manifold by a Lagrange 2-form, and defines in case of non null cohomology (non equivariance of the coadjoint action on the moment map with appearance of an additional cocyle) a Gibbs density (of maximum entropy) that is covariant under the action of dynamic groups of physics (e.g., Galileo’s group in classical physics). Souriau Lie Group Thermodynamics was also addressed 30 years after Souriau by R.F. Streater in the framework of Quantum Physics by Information Geometry for some Lie algebras, but only in the case of null cohomology. Souriau method could then be applied on Lie groups to define a covariant maximum entropy density by Kirillov representation theory. We will illustrate this method for homogeneous Siegel domains and more especially for Poincaré unit disk by considering SU(1,1) group coadjoint orbit and by using its Souriau’s moment map. For this case, the coadjoint action on moment map is equivariant. For non-null cohomology, we give the case of Lie group SE(2). Finally, we will propose a new geometric definition of Entropy that could be built as a generalized Casimir invariant function in coadjoint representation, and Massieu characteristic function, dual of Entropy by Legendre transform, as a generalized Casimir invariant function in adjoint representation, where Souriau cocycle is a measure of the lack of equivariance of the moment mapping.

Keywords: Lie groups thermodynamics, Lie group machine learning, Kirillov representation theory, coadjoint orbits, moment map, covariant Gibbs density, maximum entropy density, Souriau-Fisher metric, generalized Casimir invariant function


  La thèse de Kirillov, parue en 1962, a suscité immédiatement beaucoup d’intérêt…En outre, quantité de notions naturelles concernant les représentations s’interprètent géométriquement en terme d’orbites coadjointes: restriction à un sous-groupe, induction unitaire, produit tensoriel, mesure de Plancherel, la topologie de l’ensemble représentations unitaires irréductibles… Kirillov s’est vite convaincu, et il a convaincu la communauté mathématique que cette « méthode des orbites » devait être applicable à des groupes bien plus généraux que les groupes nilpotents. Il n’a pas hésité à aborder le cas des groupes de Lie connexes quelconques. Evidemment, des difficultés considérables ont surgi immédiatement. Néanmoins, Kirillov a indiqué une voie d’accès, qui ensuite a été largement utilisée.

- Jacques Dixmier, Brèves remarques sur l’œuvre de A.A. Kirillov

  On comprend ainsi comment Lagrange a pu développer les lois de la Mécanique des systèmes formés de solides sans s’occuper des variations de la température de ces corps et Fourier traiter des variations de la température de ces mêmes corps solides sans s’occuper de leur mouvement; comment on peut étudier le mouvement de la Terre, assimilée à un solide rigide, sans se préoccuper de la température de cet astre et étudier le refroidissement du globe terrestre sans se préoccuper de son mouvement. Une telle indépendance entre les problèmes qui ressortissent à la Mécanique et les problèmes qui ressortissent à la Théorie de la chaleur n’existe plus lorsque les systèmes auxquels on a affaire ne sont plus des systèmes classiques; si, par exemple, au lieu de regarder la Terre comme un solide rigide, d’état invariable, on tient compte des changements de volume, de forme, d’état physique et chimique qui accompagnent son refroidissement, on ne peut plus séparer le problème du mouvement de la Terre et le problème du refroidissement terrestre. … On sait que cette forme de relations supplémentaires avait été introduite par Newton et les géomètres du XVIIIème siècle dans la théorie du son. Ces considérations montrent que les questions qui ressortissent à la Thermodynamique ont dû solliciter l’attention des physiciens dès qu’on a voulu aborder l’étude des systèmes autres que des systèmes classiques; et, en fait, c’est la théorie de la propagation du son dans l’air qui a provoqué Laplace à créer la Thermodynamique.

- P. Duhem, L’intégrale des forces vives en thermodynamique, JMPA 4:5-19, 1898[1,2,3,4]

  Sous cette aspiration, la physique qui était d’abord une science des “agents” doit devenir une science des “milieux”. C’est en s’adressant à des milieux nouveaux que l’on peut espérer pousser la diversification et l’analyse des phénomènes jusqu’à en provoquer la géométrisation fine et complexe, vraiment intrinsèque…Sans doute, la réalité ne nous a pas encore livré tous ses modèles, mais nous savons déjà qu’elle ne peut en posséder un plus grand nombre que celui qui lui est assigné par la théorie mathématique des groupes

- Gaston Bachelard, Etude sur l’Evolution d’un problème de Physique –La propagation thermique dans les solides, 1928

1. Introduction

The previous French quotes by the Mathematician Jacques Dixmier, the Physicist Pierre Duhem, and the Philosopher Gaston Bachelard are important to introduce the epistemological context of models that will be developed in the paper. Jacques Diximer refers to Alexander Kirillov seminal idea of coadjoint orbits method to consider Lie group representation model. Pierre Duhem makes comments to the origin of the gap between the theory of heat and the theory of Mechanics. Finally, Gaston Bachelard make prediction that new Thermodynamics foundations will be given by groups. We will try in this paper, to prove that these ideas could be reconciled by the Souriau model of Lie groups Thermodynamics through the mathematical structure of Lie algebra cohomology.

After a the state of the art and trends in Machine Learning based on Information Geometry, we will present, in this introduction, the main objective of this paper to jointly apply models from geometric statistical mechanics and tools from Information geometry to solve “Gauss density” definition problem for statistics on Lie groups and homogeneous manifolds. We will also present use-cases motivation for Lie group machine learning illustrating for Doppler statistics analysis with SU(1,1) statistics, and for kinematics data analysis with SE(2) statistics.

1.1. State of the Art and Trends in Machine Learning Based on Information Geometry

The classical simple gradient descent used in Deep Learning has two drawbacks: the use of the same non-adaptive learning rate for all parameter components, and a non-invariance with respect to parameter re-encoding inducing different learning rates. As the parameter space of multilayer networks forms a Riemannian space equipped with Fisher information metric, instead of the usual gradient descent method, the natural gradient or Riemannian gradient method, which takes account of the geometric structure of the Riemannian space, is more effective for learning. The natural gradient preserves this invariance to be insensitive to the characteristic scale of each parameter direction. The Fisher metric defines a Riemannian metric as the Hessian of two dual potential functions (the Entropy and the log-partition function). Yann Ollivier and Gaétan Marceau-Caron provided in 2016 [5] the first experimental results on non-synthetic data sets for the quasi-diagonal Riemannian Natural gradient descents for neural networks introduced previously by Yann Ollivier in [6] (MNIST, SVHN, and FACE data sets). The quasi-diagonal Riemannian algorithms consistently beat simple stochastic gradient gradient descents by a varying margin. The computational overhead with respect to simple backpropagation is around a factor 2, and reach their final performance quickly, thus requiring fewer training epochs and a smaller total computation time. The main goal of natural gradient is to obtain invariance properties, such as, for a neural network, insensitivity of the training algorithm to whether a logistic or tanh activation function is used, or insensitivity to simple changes of variables in the parameters, such as scaling some parameters. In 2017, same authors have introduced the resulting natural Langevin dynamics [7] combining the advantages of natural gradient descent and Fisher-preconditioned Langevin dynamics for large neural networks, validated on MNIST with Fisher matrix preconditioning. With all invariance properties of natural gradient, this Langevin Dynamics avoids overfitting as a regularization method, and replaces classical methods based on a controlled amount of noise to stochastic gradient descents, that ensures convergence to the Bayesian posterior on model parameters. The theoretically optimal covariance of the noise is the inverse Fisher metric, and Y. Ollivier and G. Marceau-Caron have shown how to implement this in practice with neural networks using efficient Fisher metric approximations. In 2017, Yann Ollivier has also introduced TANGO algorithm (True Asymptotic Natural Gradient Optimization) [8], which converges to a true natural gradient descent in the limit of small learning rates, without explicit Fisher matrix estimation, and where in large dimension, small learning rates will be required to approximate the natural gradient well. Y. Ollivier has also shown that it is possible to get arbitrarily close to exact natural gradient descent with a lightweight algorithm. About natural gradient for Deep Learning, we can refer to [9,10]. This year, Shun-ichi Amari [11] has given an elementary geometrical proof that any target function is realized in a sufficiently small neighborhood of any randomly connected deep network, provided the width (the number of neurons in a layer) is sufficiently large.

In this paper, we will introduce how to extend these approaches for data as elements of Lie groups or data lying on a homogeneous manifold where a Lie group acts transitively. This extension is considered in the framework and interconnexion of Souriau “Lie groups Thermodynamics”, Information Geometry and Kirillov representation theory [12] to define probability densities for Lie groups, as Souriau covariant Gibbs densities (density of Maximum of Entropy). We will develop this case for the matrix Lie group SU(1,1) (case with null cohomology) through the computation of Souriau’s moment map, and Kirillov’s orbit method. We will also develop the method for SE(2) Lie group (case with non-null cohomology) where a Souriau cocycle should be taken into account due to the defect of equivariance of the coadjoint action on the moment map.

Supervised learning approaches are based on neural networks whose parameters are estimated by natural gradient algorithms. Non-supervised algorithm are based on clustering by using technics called “k-means” or “Mean-shift” using distance between elements of the dataset. In both cases, if we want to extend these approaches for Lie groups dataset, we have to extend the notion of Gaussian densities and distance between elements. We propose to use Geometric Statistical Model coming from Geometric Statistical Mechanics to introduce “Gauss density” of Lie group elements. Jointly, we can associate a natural distance between these Lie group elements on the Symplectic manifold by means of KKS 2-form, introducing a natural Riemannian metric associated to Fisher Metric from Information Geometry. The objective of this paper is to explain how to use Geometric Statistical Mechanics tools in this context.

1.2. Objectives of this Paper

The purpose of this article is multiple. The work of Professor Jean-Marie Souriau is well known in the field of “Geometric Mechanics” of which he is one of the founders with his book “structure of dynamic systems” published in 1969, and in which he introduced the foundations of Symplectic Geometry. Inside this book, chapter IV dealing with the extension of Geometric Mechanics to Statistical Mechanics, has been little read or misunderstood by this community. We have discovered that this model was part of and generalized another discipline, which is called Information Geometry. We have demonstrated in other previous articles that one could generalize Fisher metric (invariant metric used in Information Geometry) for Lie groups, with this model. It is therefore a question of rehabilitating the work of Jean-Marie Souriau in a broader framework, which concerns statistics and machine learning extended to objects considered as elements of a Lie group or a homogeneous manifold.

The second goal is to solve with these new tools problems that were still unsolved in statistics and machine learning. These unresolved problems concern the definition and calculation of the expression of probability densities, playing the role of Gaussian density, for elements of a Lie group or elements of homogeneous manifolds. In this article, we completely solve the problem for 2 Lie groups very useful in machine learning but also in physics, the Lie groups SU(1,1) and SE(2). The calculation is not a simple application of the Souriau model, because it is necessary to establish the “moment map” associated with these groups and define a Laplace transform on their coadjoint orbits of these groups (action of the group on the dual space of Lie algebra). In a second step, we must use Information Geometry to write these covariant Gibbs densities in the correct parametrization which parametrizes the generalized Gaussian law from statistical moments on the homogeneous symplectic manifold associated with coadjoint orbits. In the case SU(1,1), which corresponds to a case of null cohomology (equivariance of the coadjoint operator on the moment application), as the homogeneous symplectic manifold to the coadjoint orbit is the Poincaré unit disk, we solve jointly, an open problem to define mathematically the notion of Gaussian density in this disk in hyperbolic geometry. With the property that this density is by construction invariant under the action of the group SU(1,1), which is the condition sine qua none to preserve the symmetries and the invariance of the associated Fisher metric. We show that this model achieves a breakthrough in machine learning, because we have a Gibbs density and a Fisher metric invariant by change of parametrization and invariant under the action of symmetries. Gibbs density allows us to extend the classical supervised statistical machine learning algorithms, and Fisher metric allows us to adress unsupervised learning problem as k-means problems in metric space. The model opens the way to machine learning for Lie groups with multiple applications in robotics, sensor signal processing, image processing.

In the last part of the article, based on this model, we give a new “geometric” definition of Entropy by showing that Entropy is an invariant Casimir function in coadjoint representation. The Casimir functions have been widely studied within the framework of Poisson structures and manifolds [13,14,15,16]. This characterization of Entropy is new, because previously Entropy was defined axiomatically. Using this Casimir function property, we show that it is possible to use full geometric approaches to construct the Entropy function only from the structure coefficients of the Lie group associated with the symmetries involved. We show that we can also introduce an Euler-Poincaré equation and its stochastic variant to study other open problems in statistics and thermodynamics. The application of this Casimir characterization, which is demonstrated in this article, are developed in another twin article published in the same special issue with François Gay-Balmaz [17].

1.3. Motivation of Lie Group Machine Learning with Use-Cases

Machine learning is a field of study of artificial intelligence, which is based on statistical approaches to give computers the ability to “learn” from data, that is, to classify data from observations in a supervised or non-supervised way. Machine learning generally has two phases. The first consists in estimating a model from data, called observations. This so-called “training” phase is generally carried out before the practical use of the model. The second phase corresponds to the start of production: the model being determined, new data can then be classified. According to the information available during the learning phase, learning is qualified in different ways. If the data is labeled (that is, the task response is known for that data), it is supervised learning. We speak of classification if the labels are discrete, or of regression if they are continuous. In the most general case, without a label, we seek to determine the underlying structure of the data (which can be a probability density) and it is then unsupervised learning. Machine learning can be applied to different types of data, such as graphs, trees, curves, or more simply feature vectors, which can be continuous or discrete. We propose to extend the approach, when datasets are element of matrix lie groups.

Learning algorithms can be categorized according to their learning mode. For supervised learning, the classes are predetermined and the examples known, and then the system learns to classify according to a classification model. An expert must label examples beforehand. The process takes place in two phases. For unsupervised learning, when the system or operator has only examples, but no label, and the number of classes and their nature have not been predetermined, we speak of unsupervised learning or clustering. No expert is required. The algorithm must discover for itself the more or less hidden structure of the data, by data partitioning and data clustering. The system must cluster the data according to their available attributes, to classify them into homogeneous groups of examples. Similarity is generally calculated according to a distance function between pairs of examples.

We will illustrate two problems of Machine Learning on Lie groups coming from Radar Industry. Target recognition on Radar micro-Doppler data could be modeled by a problem of classification of dataset considered as elements of SU(1,1) Lie group (see Figure 1). Radar complex time series of micro-Doppler observation of data are classically processed on sliding time window to estimate their associated covariance matrices that are characterized by a Toeplitz Hermitian Positive-definitiveness structure. Using a well-known Verblunsky/Trench Theorem, we can parametrize all Toeplitz Hermitian Positive Definite Covariance matrices of stationary Radar Time series in a product space with a real positive axis (for signal power) and a Poincaré polydisk (for Doppler Spectrum shape). If we consider the Poincaré Unit Disk as an homogeneous space where SU(1,1) Lie group acts transitively. Each data in Poincaré unit disk of this polydisk could be then coded by SU(1,1) matrix Lie group element. We have transformed the problem into a statistical learning challenge processing data of SU(1,1) matrix Lie group. Another exemple considers flying object recognition on their kinematics coded in SE(2) or SE(3) Lie Groups. 3D (or 2D) trajectories could be coded by SE(3) (or SE(2)) Lie group time series provided through Invariant Extended Kalman Filter (IEKF) Radar Tracker, that locally estimates displacement of Frenet-Seret frame. Object kinematics will be then coded by time series of SE(3) (or SE(2)) matrix Lie groups characterizing local rotation/translation of Frenet frame along the drone 3D (or 2D) trajectory. Statistics of this SE(3) (or SE(2)) Lie group elements will characterize flight mechanics of different kinds of object (birds, drones, …).

Figure 1.

Figure 1

Illustration of Statistics of Doppler Spectrum as densities of points in Poincaré polydisc where SU(1,1) Lie group act transitively. Top is a time-doppler frequency spectrum of a drone. Bottom is a Doppler spectrum fluctuation coded as densities in Poincaré Polydisk.

SU(1,1) or SE(2) are also fundamental tools in Image Processing (Sub-Riemannian Geometry of vision with SE(2)), in robotics (rigid bodies statistical analysis with SE(2)), in Natural Langage Processing (methods of graph-embedding in Poincaré disk with SU(1,1)), …. For instance, SU(1,1) Lie group which acts on Poincaré unit Disk is highly studied to embed isometrically a graph in an hyperbolic space. It is used by GAFAM (Google, Facebook, …) for Natural Language Processing by reducing graph analysis to a Machine Learning problem in Hyperbolic Poincaré Unit Disk. Hyperbolic Neural network [18] have been developed in this framework. SU(1,1) Lie group is also fundamental in Quantum physics to describe Coherent states of an electron in a magnetic field for instance [19] and Coherent states in Quantum Optics [20] (some statistical photon-counting aspects of SU(1,1) coherent states are emphasized). SE(2) Lie group is especially fundamental for Geometry of Vision considering sub-Riemannian approaches of the Citti-Petitot-Sarti Model [21] but also also in neuroimagery [22].

1.3.1. SU(1,1) Lie Group Machine Learning for Doppler Data Statistics Analysis

Lie group structure appears naturally on Doppler data, if we consider time series of locally stationary signal and their associated covariance matrix. Covariance matrix is Toeplitz Hermitian Positive Definite. Based on Theorem due to Verblunsky [23,24] and Trench [25], we can parametrize Hermitian Positive Definite Matrix in product space involving the Poincaré unit Polydisk:

φ:THDP(n)R+*×Dn1Rn(P0,μ1,,μn1) (1)

where D is the Poincaré Unit Disk:

D={z=x+iyC/|z|<1} (2)

The Poincaré unit disk is an homogeneous bounded domain where the Lie group SU(1,1) act transitively. This Matrix Group is given by:

SU(1,1)={[abb*a*]/|a|2|b|2=1, a,bC} (3)

where SU(1,1) acts on the Poincaré Unit Disk by:

gSU(1,1)g.z=az+bb*z+a (4)

with Cartan Decomposition of SU(1,1)

(abb*a*)=|a|(1zz*1)(a/|a|00a*/|a|)with z=b(a*)1,|a|=(1|z|2)1/2 (5)

We can observe that z=b(a*)1 could be considered as action of gSU(1,1) on the centre on the unit disk z=g.0=b(a*)1. The principal idea is that we can code any point z=b(a*)1 in the unit disk by an element of the Lie group SU(1,1). Main advantage is that the point position is no longer coded by coordinates but intrinsically by transformation from the orogin 0 to this point. Finally, a covariance matrix of a stationary signal could be coded by (n−1) Matrix SU(1,1) Lie group elements:

 THPDR+*×Dn1R+*×SU(1,1)n1Rn(P0,μ1,,μn1)(P0,[a1b1b1*a1*],,[an1bn1bn1*an1*]) (6)

1.3.2. SE(2) and SE(3) Lie Groups Machine Learning for Kinematics Data Statistics Analysis

When we consider a 3D trajectory of a mobile target, we can describe this curve by a time evolution of the local Frenet-Serret frame (local frame with tangent vector, normal vector and binormal vector) as illustrated in Figure 2. This frame evolution is described by the Frenet-Serret formula that gives the kinematic properties of the target moving along the continuous, differentiable curve in 3D Euclidean space ℝ3. More specifically, the formulas describe the derivatives of the so-called tangent, normal, and binormal unit vectors in terms of each other.

ddt(tnb)=[0κ0κ0γ0γ0](tnb) with {κ:curvatureγ:torsion (7)
Figure 2.

Figure 2

Illustration of times series of local Frenet-Serret Frame where SE(3) Lie group act between two successive frames.

We will consider motions determined by exponentials of paths in the Lie algebra. Such a motion is determined by a unit speed space-curve τ(t). Now in a Frenet-Serret motion a point in the moving body moves along the curve and the coordinate frame in the moving body remains aligned with the tangent t, normal n, and binormal b, of the curve. Using the 4-dimensional representation of the Lie group SE(3), the motion can be specified as:

G(t)=(R(t)τ(t)01)SE(3) (8)

where τ(t) is the curve and the rotation matrix R(t) has the unit vectors t, n, and b as columns:

R(t)=(tnb)SO(3) (9)

If we introduce the Darboux vector ω=γt+κb that we can rewritte from Frenet-Serret Formulas:

dtdt=ω×t , dndt=ω×n , dbdt=ω×b (10)

Then, we can write with Ω is the 3 × 3 anti-symmetric matrix corresponding to ω:

dRdt=ΩR (11)

We note that dτ(t)dt=t and dωdt=dγdtt+dκdtb.

The instantaneous twist of the motion G(t) is given by:

Sd=dG(t)dtG1(t)=(Ωυ00) (12)

This is the Lie algebra element corresponding to the tangent vector to the curve G(t). It is well known that elements of the Lie algebra se(3) can be described as lines with a pitch. The fixed axode of a motion G(t)SE(3) is given by the axis of Sd as t varies. The instantaneous twist in the moving reference frame is given by Sb=G1(t)SdG(t), that is, by the adjoint action on the twist in the fixed frame. The instantaneous twist Sb can also be found from the relation:

Sb=G1(t)dG(t)dt (13)
Sb=G1dGdt=(RTRTτ01)(ΩRt00)=(RTΩRRt00) (14)

We can observe that we could describe a 3D trajectory by a time series of SE(3) Lie group elements:

SE(3)={[Rτ01]/RSO(3),τR3} (15)

with

SO(3)={R/RTR=RRT=I,det2R=1} (16)

Then, the trajectory will be given by the following time series of SE(3) elements:

{[R1τ101],[R2τ201],,[Rnτn01]}SE(3)n (17)

2. New Results Introduced in the Paper

The paper is structured in two parts:

  • -

    1st Part on “Gauss Density on Lie groups”: This part is totally new in Machine learning with an extension of “Gauss densities” (defined as Maximum Entropy model) for Lie groups coupling both Souriau model (introduced in statistical physics domain), with Information Geometry in Geometric Machine Learning domain. We illustrate with two use-cases SU(1,1) and SE(2) that are the most useful Lie groups in Image Processing (Sub-Riemannian Geometry of vision with SE(2)), in robotics (rigid bodies statistical analysis with SE(2)), in Natural Langage Processing (SU(1,1) with methods of graph-embedding in Poincaré disk), …. Some tentatives have been developed to define noise on Lie groups by adding additional Gaussian components on elements of the Lie algebra [26,27,28,29], but these models are not mathematically correct because they do not preserve the symmetries and the moment map associated to these symmetries by the Noether Theorem.

  • -

    2nd part on “Entropy definition extension as Casimir Function”: This part gives a new geometric definition of Entropy as invariant Casimir function in coadjoint representation, explaining the invariance of entropy under the affine coadjoint action on moment map in the dual space of Lie algebra. This definition was not in the paper of Souriau. With this new definition, we can compute Entropy only by structure constraints given by the Lie group. It opens the door to new generalization of Maximum Entropy method and first of all computation of “Gaussian densities” for any Lie group. Applications of this new property is not developed in this paper but in a twin paper in the same special issue [17]. We refert to M. Gromov papers to consider more geometric structures of Entropy [30,31].

The main new results of this paper are the introduction of “Gauss density” for Lie groups or data on homogeneous space where a Lie groups acts transitively, and the full computation for SU(1,1) Lie group. This group acts transitively on the Poincaré unit disk, and so we have also solved an open problem related to Gauss density on this homogeneous space. For this purpose, the main approach has considered an extended definition of classical “Gauss density”, as introduced by Jaynes, in term of density of Maximum Entropy. In this way, the initial problem was transfert to a new one related to the good definition of Entropy for Lie groups. To address this problem, first, we have recalled the classical Euclidean case, where the Entropy S(η) could be defined as the Legendre transform of minus the log-partition function Φ(θ)=logReθ,ydy (defined by Laplace transform) by the following equation S(η)=θ,ηΦ(θ) with ηi=Φ(θ)θi and θi=S(η)ηi. The next step was to explain how to extend the log-partition function for Lie groups. We have then considered the Laplace transform in the framework of Lie group representation theory as introduced by Alexander Kirillov and Geometric Statical Mechanics as modeled by Jean-Marie Souriau. We have preserved the same Legendre structure, and have defined the Entropy S(Q), parametrized on the dual space of the Lie algebra Qg* (called geometric heat), as Legendre transform of minus of the log-partition function Φ(β)=logMeJ(ξ),βdλω, parametrized on the Lie algebra by βg (called geometric Planck Temperature), from a Laplace transform defined on the homogeneous symplectic manifold (associated to the Lie group by the Kirrilov-Kostant-Souriau 2-form called KKS 2-form in the litterature). By introducing the moment map J:Mg*, fundamental tool of representation theory introduced by Souriau, we were able to define the log-partition function on the coadjoint orbit of the Lie group, Φ(β)=logg*eJ(ξ),βdλω. The entropy is then given by the Legendre transform S(Q)=Q,βΦ(β) with Q=Φ(β)βg* and β=S(Q)Qg. We have then defined the Gauss density for Lie groups as the density that maximizes this Entropy S(Q) under the constraint of its associated first moment Q=Φ(β)β=MJ(ξ)pGibbs(ξ)dλω. The Gauss density is then established by analogy with thermodynamics as the Gibbs density pGibbs(ξ)=eΦ(β)J(ξ),β=eJ(ξ),βMeJ(ξ),βdλω. But this is not enough, because this density is not given in the good parametrization. We have proposed to express the Gibbs density with respect to the 1st statistical moment Q (statistical mean of moment map) by inverting the relation Q=Φ(β)β=Θ(β). The Gibbs density pGibbs,Q(ξ)=eΦ(β)J(ξ),Θ1(Q) with β=Θ1(Q) will provide the extended definition of Gauss density in final good parametrization.

For the time being, no “Gaussian density” was defined on Poincaré unit disk with the mandatory property to be covariant under the action of SU(1,1) Lie group that acts transitively on this homogeneous bounded domain. We have applied the previous model via computation of moment map and developed the full computation of this extended Gauss density for SU(1,1) Lie group, SU(1,1)={(abb*a*)/a,bC, |a|2|b|2=1} and then deduced as consequence the gauss density for the Poincaré unit disk considered as the homogeneous symplectic manifold associated to the coadjoint orbit of the SU(1,1) Lie group via KKS 2 form. Considering the Lie algebra su(1,1)={(irηη*ir)/rR,ηC} and the dual space of the Lie algebra su(1,1)*={(zx+iyx+iyz)/x,y,zR}, we have computed the moment map J:Dsu*(1,1) defined by J(z).ui=Ji(z,z*), that maps D the Poincaré unit disk into a coadjoint orbit in su*(1,1), J(z)=J1(z,z*)u1*+J2(z,z*)u2*+J3(z,z*)u3*=ρ(1+|z|21|z|22z*1|z|22z1|z|21+|z|21|z|2)g* The moment map J is a diffeomorphism of D onto one sheet of the two-sheeted hyperboloid in su*(1,1), determined by the following equation J12J22J32=ρ2 , J1ρ with J1u1*+J2u2*+J3u3*su*(1,1). But the full SU(1,1) Lie group is not related to any equilibrium Gibbs state (the open subset of the Lie algebra, associated to this Gibbs state is empty). We have then considered one-parameter subgroups of the Lie group SU(1,1) such that the open subset Λβ={βg/DeJ(z),βdλ(z)<+}  is not empty. In the neighborhood of the identity element, the elements of gSU(1,1) can be written as the exponential of an element β of its Lie algebra. If we make the remark that we have the following relation β2=(irηη*ir)(irηη*ir)=(|η|2r2)I, we can developed the exponential map by a Taylor expansion of the exponential function, which is given by the following relation g=exp(εβ)=k=0(εβ)kk!=(cosh(εR)+irsinh(εR)Rηsinh(εR)Rη*sinh(εR)Rcosh(εR)irsinh(εR)R)with R2=|η|2r2.

We can observe that one condition is that |η|2r2>0 then the subset to consider is given by the subset Λβ={β=(irηη*ir),rR,ηC/|η|2r2>0}  such that DeJ(z),βdλ(z)<+. Finally, we have computed the covariant Gibbs density in the unit disk given by βΛβ  and by the moment map of the Lie group SU(1,1), that could be expressed in the following equation: pGibbs(z)=eJ(z),βDeJ(z),βdλ(z)==eρ(1+|z|2(1|z|2)2z*(1|z|2)2z(1|z|2)1+|z|2(1|z|2)),(irηη*ir)DeJ(z),βdλ(z) with dλ(z)=2iρdzdz*(1|z|2)2. To write the final Gibbs density with respect to its statistical moment, we rewrite the density with Q=E[J(z)], by β=Θ1(Q)g where Q=Φ(β)β=Θ(β)g* and Q=E[J(z)]=E[ρ(1+|w|2(1|w|2)2w*(1|w|2)2w(1|w|2)1+|w|2(1|w|2))].

To extend this approach for covariant Gibbs density on Siegel Unit Disk SD={ZMpq(C)/IpZZ+>0}, that is a classical matrix extension of Poincaré unit Disk, we have proposed to consider G=SU(p,q) unitary group and the homogeneous space G/K=SU(p,q)/S(U(p),U(q)) with K=S(U(p)×U(q))={(A00D)/AU(p),DU(q),det(A)det(D)=1} and the moment map given by J(Z)=iλ((IpZZ+)1(pZZ+qIp)(p+q)Z(IqZ+Z)1(p+q)(IqZ+Z)1Z+(pIq+qZ+Z)(IqZ+Z)1).

After SU(1,1) Lie group (case with null cohomology), we have considered the same model for SE(2) Lie group with non-null cohomology that needs the use of symplectic one-cocycle to manage the defect of cohomology. We have considered the special Euclidean group SE(2)={[Rφτ01]/RφSO(2),τR2} with SO(2)={Rφ=[cosφsinφsinφcosφ]/φR}, and the Lie algebra se(2) of SE(2) (ξ,u)se(2)=R×R2[ξu00]se(2) with =[0110], to define the moment map J(ξ,u)(x):R2se*(2) that is given by the expression J(ξ,u)(x)=J(x).(ξ,u) with J(x)=2(12x2,x), xR2. Then, the Gibbs density is deduced for generalized temperature βΩ={(b,Β)se(2)/b<0,ΒR2} by pGibbs(x)=eJ(x),βR2eJ(x),βdλ(x)=e12bx2Β.xR2e12bx2Β.xdλ(x), with the log-partition function given by the following expression Φ(β)=logR2e12bx2Β.xdλ(x)=log(2πbe12bB2) with Q=Φ(β)β=(1bΒ22b2,1bΒ)=Θ(β) and where QΩ*={(m,M)se*(2)/m+M22<0}. To obtain the good parametrization related to statical moments, we have inverted the relation β=Θ1(Q)=((m+12M2)1,(m+12M2)1M), to provide the covariant Gibbs density parametrized by (m,M)=E(J(x))=E[2(12x2,x)]=[E(x2),2E(x)]. The final Gauss density for SE(2) is then pGibbs(x)=e12x2M.x(m+12M2)R2e12x2M.x(m+12M2)dλ(x).

We conclude the paper by a deeper study of Souriau model structure. We observe that Souriau Entropy S(Q) defined on coadjoint orbit of the group has a property of invariance S(Adg#(Q))=S(Q) with respect to Souriau affine definition of coadjoint action Adg#(Q)=Adg*(Q)+θ(g) where θ(g) is called the Souriau cocyle. In the framework of Souriau Lie groups Thermodynamics, we can then characterize the Entropy as a generalized Casimir invariant function in coadjoint representation, and Massieu characteristic function (or log-partition function), dual of Entropy by Legendre transform, as a generalized Casimir function in adjoint representation. When M is a Poisson manifold, a function on M is a Casimir function if and only if this function is constant on each symplectic leaf (the non-empty open subsets of the symplectic leaves are the smallest embedded manifolds of M which are Poisson submanifolds) [15]. Classically, the Entropy is defined axiomatically as Shannon or von Neumann Entropies without any geometric structures constraints. In this paper, the Entropy is also presented as solution of the Casimir equation (adSQ*Q)j+Θ(SQ)j=Cijkad(SQ)i*Qk+Θj=0 with Θ˜(X,Y)=Θ(X),Y=J[X,Y]{JX,JY}=dθ(X),Y, X,Yg, where Θ(X)=Teθ(X(e)) appears in case of non-null cohomology (non-equivariance of coadjoint operator on the moment map), with θ(g) the Souriau Symplectic cocycle. The dual space of the Lie algebra foliates into coadjoint orbits that are also the level sets on the entropy. The KKS (Kostant-Kirillov Souriau) 2-form, and the Souriau-Koszul-Fisher metric transform each orbit into a homogeneous Symplectic manifold. The information manifold foliates into level sets of the entropy that could be interpreted in the framework of Thermodynamics by the fact that motion remaining on this complex surfaces is non-dissipative, whereas motion transversal to these surfaces is dissipative, where the dynamic is given by dQdt={Q,H}Θ˜=adHQ*Q+Θ(HQ) with stable equilibrium when H=SdQdt={Q,S}Θ˜=adSQ*Q+Θ(SQ)=0. We have finally also observed that dS=Θ˜β(HQ,β)dt where Θ˜β(HQ,β)=Θ˜(HQ,β)+Q,[HQ,β] , showing that Entropy production is linked with Souriau tensor related to Fisher metric.

The Casimir equations that we have introduced in non-zero cohomology case are consequences of the constancy of the entropy on adjoint orbits of the Lie algebra and of the equivariance of the map between the set of generalized temperatures and the dual space of the Lie algebra, as introduced by Jean-Marie in his 1974 paper. We explained this fact in the paper by starting elaboration of Casimir equations from the Souriau equation. Casimir equations are then presented in this context, as a fully equivalent form written in a new way, especially in the framework of Souriau Lie groups Thermodynamics. Souriau has not observed that the Entropy is an invariant Casimir function in coadjoint representation, but we can assume that he was fully aware of this invariant structure.

From Souriau equation Q,[β,Z]+Θ˜(β,Z)=0 published in 1974, we have rewritten as direct consequence this equation on a Casimir form adSQ*Q+Θ(SQ)=0. This equation preserves the geometric structures included in Souriau equation but allow us to consider the Entropy from the point of view of Casimir invariant function. The concept of Entropy and the concept of Casimir function were, for the time being, two disjoint concepts that have been developed independently in the past. There is a large literature on Casimir function, especially the russian one that have characterized properties of Casimir function. We refer to Igor V. Shirokov who has proposed a method for constructing invariants of the coadjoint representation of Lie groups with an arbitrary dimension and structure based on local symplectic coordinates on the coadjoint orbits. With Oleg L. Kurnyavko, Igor V. Shirokov has also proposed a general method for constructing invariant Casimir functions. The second reference is about A.T. Fomenko and V.V. Trofimov who have also deeply studied Casimir functions (but in case of null cohomology) and have developed the following equation that we can write for Entropy in null cohomology case S(Adetξ*Q)=S(Q)+n=1(ϕ(ξ))nSn!(Q).tn with ϕ:gVec(Γ) a representation of Lie algebras defined on basis (e1,e2,,en) in g. We refer to a twin paper [17] developing consequences of this new definition of Entropy as an invariant Casimir function. In this twin paper, we study the associated Euler-Poincaré equation dQdt=adHQ*Q+Θ(HQ) and the stochastic extension based on a new Stratonovich differential equation for the stochastic process given by the following relation by mean of Souriau’s symplectic cocycle dQ+[adHQ*Q+Θ(HQ)]dt+i=1N[adHiQ*Q+Θ(HiQ)]dWi(t)=0. These kind of stochastic equations have been also studied by Alexis Arnaudon and Daryl Holm but only in the restricted case of null-cohomology [32].

We give references from classical textbooks (as Souriau book and papers) to preprints because different approaches have been developed in parallel to address Lie groups statistics, as soon as mid of last century, but without bridges between these disciplines which have developed specific tools to address this problem. We have limited these references to main and important documents, which are characterized as seminal and as tutorial of their domains. We have preserved references in French, because some works as Souriau Lie groups Thermodynamics model have not been yet largely spread towards the different communities.

3. Learning Inference Lie Groups Thermodynamics and Covariant Gibbs Density

We identify the Riemanian metric introduced by Souriau based on cohomology, in the framework of “Lie groups thermodynamics” as an extension of classical Fisher metric introduced in information geometry. We have observed that Souriau metric preserves Fisher metric structure as the Hessian of the minus logarithm of a partition function, where the partition function is defined as a generalized Laplace transform on a sharp convex cone. Souriau’s definition of Fisher metric extends the classical one in case of Lie groups or homogeneous manifolds. Souriau has developed this “Lie groups thermodynamics” theory in the framework of homogeneous symplectic manifolds in geometric statistical mechanics for dynamical systems, but as observed by Souriau, these model equations are no longer linked to the symplectic manifold but equations only depend on the Lie group and the associated cocycle [33,34]. This analogy with Fisher metric opens potential applications in machine learning, where the Fisher metric is used in the framework of information geometry, to define the “natural gradient” tool for improving ordinary stochastic gradient descent sensitivity to rescaling or changes of variable in parameter space. In machine learning revised by natural gradient of information geometry, the ordinary gradient is designed to integrate the Fisher matrix. Amari has theoretically proved the asymptotic optimality of the natural gradient compared to classical gradient. With the Souriau approach, the Fisher metric could be extended, by Souriau-Fisher metric, to design natural gradients for data on homogeneous manifolds. Information geometry has been derived from invariant geometrical structure involved in statistical inference. The Fisher metric defines a Riemannian metric as the Hessian of two dual potential functions, linked to dually coupled affine connections in a manifold of probability distributions. With the Souriau model, this structure is extended preserving the Legendre transform between two dual potential function parametrized in Lie algebra of the group acting transentively on the homogeneous manifold.

3.1. Inference by Natutal Gradient and Legendre Structure

Classically, to optimize the parameter θ of a probabilistic model, based on a sequence of observations yt, is an online gradient descent:

θtθt1ηtlt(yt)Tθ (18)

with learning rate ηt, and the loss function lt=logp(yt/y^t). This simple gradient descent has a first drawback of using the same non-adaptive learning rate for all parameter components, and a second drawback of non invariance with respect to parameter re-encoding inducing different learning rates. Amari has introduced the natural gradient to preserve this invariance to be insensitive to the characteristic scale of each parameter direction. The gradient descent could be corrected by I(θ)1 where I is the Fisher information matrix with respect to parameter θ, given by:

I(θ)=[gij] with gij=[Eyp(y/θ)[2logp(y/θ)θiθj]]ij (19)

with natural gradient:

θtθt1ηtI(θ)1lt(yt)Tθ (20)

Amari has proved that the Riemannian metric in an exponential family is the Fisher information matrix defined by:

gij=[2Φθiθj]ij with Φ(θ)=logReθ,ydy (21)

and the dual potential, the Shannon entropy, is given by the Legendre transform:

S(η)=θ,ηΦ(θ) with ηi=Φ(θ)θi and θi=S(η)ηi (22)

We can observe that Φ(θ)=logReθ,ydy=logψ(θ) is linked with the cumulant generating function.

J.L. Koszul and E. Vinberg have introduced an affinely invariant Hessian metric on a sharp convex cone through its characteristic function:

ΦΩ(θ)=logΩ*eθ,ydy=logψΩ(θ) with θΩ sharp convex coneψΩ(θ)=Ω*eθ,ydy with Koszul-Vinberg Characteristic function (23)

Jean-Louis Koszul has introduced the following forms

1st Koszul form:

α=dΦΩ(θ)=dlogψΩ(θ) (24)

2nd Koszul form:

γ=Dα=DdlogψΩ(θ) (25)

with the following property of positive definitiveness:

(DdlogψΩ(x))(u)=1ψΩ(u)2[Ω*F(ξ)2dξ.Ω*G(ξ)2dξ(Ω*F(ξ).G(ξ)dξ)2]>0with F(ξ)=e12x,ξ and G(ξ)=e12x,ξu,ξ (26)

Koszul has defined the following Diffeomorphism:

η=α=dlogψΩ(θ)=Ω*ξpθ(ξ)dξ with pθ(ξ)=eξ,θΩ*eξ,θdξ (27)

with preservation of Legendre transform:

SΩ(η)=θ,ηΦΩ(θ) with η=dΦΩ(θ) and θ=dSΩ(η) (28)

3.2. Souriau Lie Groups Thermodynamique and Souriau-Koszul-Fisher Metric

This relations have been extended by Jean-Marie Souriau in geometric statistical mechanics, where he developed a “Lie groups thermodynamics” of dynamical systems where the (maximum entropy) Gibbs density is covariant with respect to the action of the Lie group. In the Souriau model, previous structures of information geometry are preserved:

I(β)=2Φβ2 with Φ(β)=logMeU(ξ),βdλω and U:Mg* (29)
S(Q)=Q,βΦ(β) with Q=Φ(β)βg* and β=S(Q)Qg (30)

In the Souriau Lie groups thermodynamics model, β is a “geometric” (Planck) temperature, element of Lie algebra g of the group, and Q is a “geometric” heat, element of the dual space of the Lie algebra g* of the group. Souriau has proposed a Riemannian metric that we have identified as a generalization of the Fisher metric:

I(β)=[gβ] with gβ([β,Z1],[β,Z2])=Θ˜β(Z1,[β,Z2]) (31)
with Θ˜β(Z1,Z2)=Θ˜(Z1,Z2)+Q,adZ1(Z2)  where adZ1(Z2)=[Z1,Z2] (32)

Souriau has proved that all co-adjoint orbit of a Lie group given by OF={Adg*F=g1Fg,gG}subset of g*,Fg* carries a natural homogeneous symplectic structure by a closed G-invariant 2-form. If we define K=Adg*=(Adg1)* and K*(X)=(adX)* with Adg*F,Y=F,Adg1Y,gG,Yg,Fg* where if Xg, Adg(X)=gXg1g, the G-invariant 2-form is given by the following expression σΩ(adXF,adYF)=BF(X,Y)=F,[X,Y],X,Yg. Souriau Foundamental Theorem is that « Every symplectic manifold on which a Lie group acts transitively by a Hamiltonian action is a covering space of a coadjoint orbit ». We can observe that for Souriau model, Fisher metric is an extension of this 2-form in non-equivariant case gβ([β,Z1],[β,Z2])=Θ˜(Z1,[β,Z2])+Q,[Z1,[β,Z2]] .

The Souriau additional term Θ˜(Z1,[β,Z2]) is generated by non-equivariance through Symplectic cocycle. The tensor Θ˜ used to define this extended Fisher metric is defined by the moment map J(x), application from M (homogeneous symplectic manifold) to the dual space of the Lie algebra g*, given by:

Θ˜(X,Y)=J[X,Y]{JX,JY} (33)
with J(x):Mg* such that JX(x)=J(x),X, Xg (34)

This tensor Θ˜ is also defined in tangent space of the cocycle θ(g)g* (this cocycle appears due to the non-equivariance of the coadjoint operator Adg*, action of the group on the dual space of the lie algebra; the action of the group on the dual space of the Lie algebra is modified with a cocycle so that the momentu map becomes equivariant relative to this new affine action):

Q(Adg(β))=Adg*(Q)+θ(g) (35)

θ(g)g* is called nonequivariance one-cocycle, and it is a measure of the lack of equivariance of the moment map.

Θ˜(X,Y):g×gwith Θ(X)=Teθ(X(e))X,YΘ(X),Y (36)

The cocycle should verify:

θ(st)=J((st).x)Adst*J(x)θ(st)=[J(s.(t.x))Ads*J(t.x)]+[Ads*J(t.x)Ads*Adt*J(x)]θ(st)=θ(s)+Ads*[J(t.x)Adt*J(x)]θ(st)=θ(s)+Ads*θ(t) (37)

We can also compute tangent of one-cocycle θ at neutral element, to compute 2-cocycle Θ:

ζg,θζ(s)=θ(s),ζ=J(s.x),ζAds*J(x),ζ=J(s.x),ζJ(x),Ads1ζTeθζ(ξ)=TxJ.ξp(x),ζ+J(x),adξζ with ξp=XJ,ξTeθζ(ξ)=XJ(x),ξ[J(x),ζ]+J(x),[ξ,ζ]Teθζ(ξ)={J,ξ,J,ζ}+J(x),[ξ,ζ]=Θ(ξ) (38)

We can also write: TxJ(ξp(x))=adξ*J(x)+Θ(ξ,.)

By differentiating the equation on affine action, we have:

dJ(Xx)=adXJ(x)+dθ(X) , xM,Xg (39)
dJ(Xx),Y=adXJ(x),Y+dθ(X),Y, xM,X,YgdJ(Xx),Y=J(x),[X,Y]+dθ(X),Y={J,X,J,Y}(x) J(x),[X,Y]{J,X,J,Y}(x)=dθ(X),Y (40)

It can be then deduced that the tensor could be also written:

Θ˜(X,Y)=J[X,Y]{JX,JY}=dθ(X),Y , X,Yg (41)

with the cocycle property:

Θ˜([X,Y],Z)+Θ˜([X,Y],Z)+Θ˜([X,Y],Z)=0 , X,Y,Zg (42)

By noting the action of the group on the dual space of the Lie algebra:

G×g*g*,(s,ξ)sξ=Ads*ξ+θ(s) (43)

Associativity is also derived:

(s1s2)ξ=Ads1s2*ξ+θ(s1s2)=Ads1*Ads2*ξ+θ(s1)+Ads1*θ(s2)(s1s2)ξ=Ads1*(Ads2*ξ+θ(s2))+θ(s1)=s1(s2ξ) , s1,s2G,ξg* (44)

This study of the moment map J equivariance, and the existence of an affine action of G on g*, whose linear part is the coadjoint action, for which the moment J is equivariant, is at the cornerstone of Souriau theory of geometric mechanics and Lie groups thermodynamics.

3.3. Souriau Entropy and Souriau-Fisher-Koszul Metric Invariance under the Action of the Group and Covariant Souriau Gibbs Density

In Souriau’s Lie groups thermodynamics, the invariance by re-parameterization in information geometry has been replaced by invariance with respect to the action of the group. When an element of the group g acts on the element βg of the Lie algebra, given by adjoint operator Adg. Under the action of the group Adg(β), the entropy S(Q) and the Fisher metric I(β) are invariant:

βgAdg(β){S[Q(Adg(β))]=S(Q)I[Adg(β)]=I(β) (45)

In the framework of Lie group action on a symplectic manifold, equivariance of moment map could be studied to prove that there is a unique action a(.,.) of the Lie group G on the dual g* of its Lie algebra for which the moment map J is equivariant, that means for each xM:

J(Φg(x))=a(g,J(x))=Adg*(J(x))+θ(g) (46)

When coadjoint action is not equivariant, the symmetry is broken, and new “cohomological” relations should be verified in Lie algebra of the group. A natural equilibrium state will thus be characterized by an element of the Lie algebra of the Lie group, determining the equilibrium temperature β. The entropy s(Q), parametrized by Q the geometric heat (mean of energy U, element of the dual space of the Lie algebra) is defined by the Legendre transform of the Massieu potential Φ(β) parametrized by β (Φ(β) is the minus logarithm of the partition function ψΩ(β)).

A Gibbs state, in the usual sense, is a statistical state at which the entropy is stationary with respect to all infinitesimal variations of the statistical state for which the mean value of the energy remains constant. In the sense of Souriau, a generalized Gibbs state is a statistical state at which the entropy is stationary with respect to all infinitesimal variations of the statistical state for which the mean value of the moment map remains constant. This generalization is very natural, since the energy can be considered as the moment map of the Hamiltonian action of the one-dimensional Lie group of time translations. Furthermore, each generalized Gibbs state is associated to an element of the Lie algebra of the group, called by Souriau a generalized temperature, and that the set of possible generalized temperature is not, in general the whole Lie algeba, but an open convex subset of the Lie algebra, which may be empty, for which some integrals encountered in the expression of the generalized Gibbs state are normally convergent. So, for some Lie groups, generalized Gibbs states do not exist, and there is no Souriau Lie groups thermodynamics.

Souriau has then defined a Gibbs density that is covariant under the action of the group:

pGibbs(ξ)=eΦ(β)U(ξ),β=eU(ξ),βMeU(ξ),βdλω , with Φ(β)=logMeU(ξ),βdλωQ=Φ(β)β=MU(ξ)eU(ξ),βdλωMeU(ξ),βdλω=MU(ξ)p(ξ)dλω (47)

We can express the Gibbs density with respect to Q by inverting the relation Q=Φ(β)β=Θ(β). Then pGibbs,Q(ξ)=eΦ(β)U(ξ),Θ1(Q) with β=Θ1(Q). All Souriau equations of Lie groups Thermodynamics are illustrated in Figure 3 and Figure 4.

Figure 3.

Figure 3

Fondamental Equation of Souriau Lie groups Thermodynamics. Q is the geometric heat in dual Lie algebra, β is the geometric temperature in Lie algebra.

Figure 4.

Figure 4

Souriau Model of Lie Groups Thermodynamics.

Souriau completed his “geometric heat theory” by introducing a 2-form in the Lie algebra, that is a Riemannian metric tensor in the values of adjoint orbit of β, [β,Z] with Z an element of the Lie algebra. This metric is given for (β,Q):

gβ([β,Z1],[β,Z2])=Θ(Z1),[β,Z2]+Q,[Z1,[β,Z2]] (48)

where Θ is a cocycle of the Lie algebra, defined by Θ=Teθ with θ a cocycle of the Lie group defined by θ(M)=Q(AdM(β))AdM*Q.

We observe that Souriau Riemannian metric, introduced with symplectic cocycle, is a generalization of the Fisher metric, that we call the Souriau-Fisher metric, that preserves the property to be defined as a Hessian of the partition function logarithm gβ=2Φβ2=2logψΩβ2 as in classical information geometry. We will establish the equality of two terms, between Souriau definition based on Lie group cocycle Θ and parameterized by “geometric heat” Q (element of the dual space of the Lie algebra) and “geometric temperature” β (element of Lie algebra) and hessian of characteristic function Φ(β)=logψΩ(β) with respect to the variable β (as illustrated in Figure 5):

gβ([β,Z1],[β,Z2])=Θ(Z1),[β,Z2]+Q,[Z1,[β,Z2]]=2logψΩβ2 (49)

Figure 5.

Figure 5

Souriau-Fisher metric as extension of KKS 2-form in case of non-null Cohomogy.

If we differentiate this relation of Souriau theorem Q(Adg(β))=Adg*(Q)+θ(g), this relation occurs:

Qβ([Z1,β],.)=Θ˜(Z1,[β,.])+Q,Ad.Z1([β,.])=Θ˜β(Z1,[β,.]) (50)
Qβ([Z1,β],Z2.)=Θ˜(Z1,[β,Z2])+Q,Ad.Z1([β,Z2])=Θ˜β(Z1,[β,Z2]) (51)
Qβ=gβ([β,Z1],[β,Z2]) (52)

As the entropy is defined by the Legendre transform of the characteristic function, a dual metric of the Fisher metric is also given by the hessian of “geometric entropy” S(Q) with respect to the dual variable given by Q: 2S(Q)Q2.

For the maximum entropy density (Gibbs density), the following three terms coincide: 2logψΩβ2 that describes the convexity of the log-likelihood function, I(β)=E[2logpβ(ξ)β2] the Fisher metric that describes the covariance of the log-likelihood gradient, whereas I(β)=E[(ξQ)(ξQ)T]=Var(ξ) that describes the covariance of the observables. We can also observe that the Fisher metric I(β)=Qβ is exactly the Souriau metric defined through symplectic cocycle:

I(β)=Θ˜β(Z1,[β,Z2])=gβ([β,Z1],[β,Z2]) (53)

The Fisher metric I(β)=2Φ(β)β2=Qβ has been considered by Souriau as a generalization ofheat capacity”. Souriau called it K the “geometric capacity”.

3.4. Covariant Souriau Gibbs Density and Information Manifold Foliation

R.F. Streater has studied in 1999, Information Geometry for some Lie algebra where for certain unitary representation of a Lie algebra, he has defined the statistical manifold of states as convex cone for which the partition function is finite, making reference to Bogoliubov-Kubo-Mori metric. But Streater has only developed the case with null cohomology for so (3) and sl (2,R) Lie alebras. Nevertheless, as observed by R.F. Streater in his paper “Information Geometry for some Lie algebras” [35], referring to Kirillov work and Roger Balian paper, “We can expect further natural structures to arise in this case. Indeed, it is known (*) that the dual to the Lie algebra, which parametrizes the state-space in this case, foliates into coadjoint orbits; there are also the level sets on the entropy; Kirillov form, and the BKM (Bogoliubov-Kubo-Mori) metric, together make each orbit into kähler space, along the lines proposed by Kostant. Motion along these holomorphic directions is nondissipative. The transversal to the orbits is a real half-line, which represents the dissipative direction…We study the case of sl (2,R) in the discrete series of representations. We show the information manifold foliates into level sets of the entropy, each being isometric to H, the Poincaré upper half-plane… The states of constant entropy are the hyperboloids and β is the dissipative coordinate… For an integrable system described by a Lie algebra in a traceable representation, we find that the information manifold foliates into complex spaces; the level sets of entropy can be given a complex structure by the method of Kostant. Motion remaining on the complex surfaces is nondissipative, whereas motion transversal to these surfaces is dissipative. In information geometry, the state is parametrized by the canonical coordinates. Which function of them is measured by a thermometer? In our models, it is reasonable to designate 1/β to be the temperature; it is a dissipative coordinate, and it increases with time, showing that the system is thermalizing”.

4. Mathematical Definition of Souriau Moment Map

Previously, we have introduced the concept of Souriau’s moment map. In this chapter, we will introduce a mathematical definition of this tool, as defined in Souriau’s book [36] with modern notations [37,38,39,40,41]. Other details on moment map are also given in Jean-Louis Koszul’s Book [42].

4.1. Operations on Vector Fields

Consider a map F:XRMYRN, y=F(x), the derivative of F at xX, DF:XRN×M is given by:

(δy1δyN)=(y1x1y1xMyNx1yNxM)(δx1δxM)=DF(x)(δx)=Limt0F(x+tδx)F(x)t (54)

Second derivative is given by the linear map D2F:XRN×M×M:

δ[yx]=2yx2(δx)=D2F(x)(δx) (55)

Consider a vector Field V on XRM defined by: V:XRMRM, operations on vector fields are given by adjoint action and Lie bracket:

AdFV(y)=ddt[FetVF1](y)|t=0=DF(x)(V(x)) with x=F1(y) (56)
[U,V](x)=ddsAdesUV(x)|s=0=DU(x)(V(x))DV(x)(U(x)) (57)

0-form is a scalar, 1-form are row ω=(ω1ωM) in dual space. 2-forms can be regarded as antisymmetric matrices (ωij) with ω(u,v)=ut(ω11ω1MωM1ωMM)v. m-forms are all scalar multiples of the standard volume form vol, defined by Vol(v1,,vm)=det(matrix with columns v1,,vm).

4.2. Derivative Rules by Sophus Lie, Elie Cartan and Henri Cartan

With the following classical definitions:

  • Pull back: F*ω is a p-form on X
    F*ω(v1,,vp)=ωF(x)(DF(x)(v1),,DF(x)(vp)) (58)
  • Interior product: iVω is the (p−1)form on M obtained by inserting V(x) as the first argument of ω
    iVω(v2,vp)=ω(V(x),v2,,vp) (59)
  • Exterior product:θω is the (p + 1)-form on X where ω is a p-form and θ is a 1-form on M (where the hat indicates a term to be omitted):
    θω(v0,,vp)=i=0p(1)iθ(vi)ω(v0,,v^i,,vp) (60)
  • Lie derivative: LVω is a p-form on M, and LVω=0 if the flow of V consists of symmetries of ω:
    LVω(v1,,vp)=ddtetV*ω(v1,,vp)|t=0 (61)
    dω is the (p+1)-form on M defined by taking the ordinary derivative of ω and then antisymmetrizing:
  • Exterior derivative:
    dω(v0,,vp)=i=0p(1)iωx(vi)(v0,,v^i,,vp) (62)
    p=0,[dω]i=iω ; p=1,[dω]ij=iωjjωi ; p=2,[dω]ijk=iωjk+jωki+kωij (63)

From these definitions, the properties of the exterior and Lie Derivative were established by Sophus Lie, Elie Cartan, and Henri Cartan:

LVω=diVω+iVdω (64)

(Elie Cartan equation)

i[U,V]ω=iVLUωLUiVω (65)

(Henri Cartan equation)

L[U,V]ω=LVLUωLULVω (66)

(Sophus Lie equation)

4.3. Souriau Moment Map

Considering Manifolds and Lie groups, We define the tangent bundle TX of X as the disjoint union of the TxX, or the set of all pairs (δxx) with xX and δxTxX. If F:XY is a smooth map between manifolds, its tangent map is the map:

F*(δxx)=(DF(x)(δx)F(x)) (67)

A Lie group is a group G with a manifold structure such that the product (g,h)gh and the inversion gg1 are smooth maps from G×G (resp. G) to G. Its Lie algebra is the tangent space g=TeG at the identity element. A smooth action of G on a manifold X is a group morphism:

Φ:G×XDiff(X)(g,x)g.x (68)

The orbit of xX is G(x)={g.x:gG}.

The tangent space to an orbit at x:

TxG(x)={Z(x):Zg}=g/gx with Z(x)=ddtetZ(x)|t=0 and where

gx={Zg:Z(x)=0} (69)

Let (M,σ) be a connected symplectic manifold. A vector field η on M is called symplectic if its flow preserves the 2-form: Lησ=0. If we use Elie Cartan’s formula, we can deduce that Lησ=diησ+iηdσ=0 but as dσ=0 then diησ=0. We observe that the 1-form iησ is closed. When this 1-form is exact, there is a smooth function xH on M with:

iησ=dH (70)

This vector field η is called Hamiltonian and could be defined as symplectic gradient η=SympH.

Let a Lie group G that acts on M and that also preserve σ. A moment map exists if these infinitesimal generators are actually hamiltonian, so that a map J:Mg* exists with:

iZXσ=dHZ where

HZ=J(x),Z (71)

The Poisson bracket of two functions H, H is defined by:

{H,H}=σ(η,η)=σ(SympH,SympH) with iησ=dH andiησ=dH (72)

If G is connected, then the moment map is G-equivariant if and only if it satisfies {HZ,HZ}=H[Z,Z].

Souriau has proved thet every coadjoint orbit of a Lie group is a homogeneous symplectic manifold when endowed with the KKS 2-form σ(Z(x),Z(x))=x,[Z,Z], and conversely, every homogeneous symplectic manifold of a connected Lie group G is, up to a possible covering, a coadjoint orbit of some central extension of G. σ is G-invariant.

5. Poincaré Unit Disk, SU(1,1) Lie Group and Souriau Moment Map

We will introduce Souriau moment map for SU(1,1)/K group that acts transitively on Poincaré Unit Disk, based on moment map. More details on computation of moment map for SU(1,1)/K Lie group is given in Appendix A of this document.

5.1. Poincaré Unit Disk and SU(1,1) Lie Group

The group of complex unimodular pseudo-unitary matrices SU(1,1), is the set of elements u such that [43,44,45,46,47,48,49,50,51,52]:

uMu+=M with M=(+1001) (73)

We can show that the most general matrix u belongs to the Lie group given by:

G=SU(1,1)={(abb*a*)/|a|2|b|2=1, a,bC} (74)

Its Cartan decomposition is given by:

(abb*a*)=|a|(1zz*1)(a/|a|00a*/|a|) with z=b(a*)1,|a|=(1|z|2)1/2 (75)
(abb*a*)(1zz*1)=|a|(1zz*1)(a/|a|00a*/|a|) with {a=bz*+az=az+bb*z+a* (76)

SU(1,1) is associated to group of holomorphic automorphisms of the Poincaré unit disk D={z=x+iyC/|z|<1} in the complex plane, by considering its action on the disk as g(z)=(az+b)/(b*z+a*). The following measure on Unit disk:

dμ0(z,z*)=12πidzdz*(1|z|2)2 (77)

is invariant under the action of SU(1,1) captured by the fractional holomorphic transformation:

dzdz*(1|z|2)2=dzdz*(1|z|2)2 (78)

The complex unit disk admits a Kähler structure determined by potential function:

Φ(z,z*)=log(1zz*) (79)

The invariant 2-form is:

Ω=1i2Φ(z,z*)zz*dzdz*=1idzdz*(1|z|2)2 (80)

which is closed dΩ=0. This group SU(1,1) is isomorphic to the group SL(2,R) as a real Lie group, and the Lie algebra g=𝖘u(1,1) is given by:

g={(irηη*ir)/rR,ηC} (81)

with the bases (u1,u2,u3)g: u1=12(0ii0) , u2=12(0110) , u3=12(i00i)

with the commutation relation:

[u3,u2]=u1,[u3,u1]=u2,[u2,u1]=u3 (82)

Dual base on the dual space of the Lie algebra is named (u1*,u2*,u3*)g*. The dual vector space g*=𝖘u*(1,1) can be identified with the subspace of 𝖘𝖑(2,C) of the form:

g*={(zx+iyx+iyz)=x(0110)+y(0ii0)+z(1001)/x,y,zR} (83)

Coadjoint action of gG on dual space of the Lie algebra ξg* is written g.ξ.

5.2. Coadjoint Orbit of SU(1,1) and Souriau Moment Map

We will use results of C. Cishahayo and S. de Bièvre [53] and B. Cahen [54,55] for computation of moment map of SU(1,1). Let rR*+, orbit O(ru3*) of ru3* for the coadjoint action of gG could be identified with the upper half sheet x3>0 of {ξ=x1u1*+x2u2*+x3u3*/x12x22+x32=r2}, the two-sheet hyperboloid. The stabilizer of ru3* for the coadjoint action of G is torus K={(eiθ00eiθ),θR}. K induces rotations of the unit disk, and leaves 0 invariant. The stabilizer for the origin 0 of unit disk is maximal compact subgroup K of SU(1,1). We can observe [54] that O(ru3*)=G/K. On the other hand O(ru3*)=G/K is diffeomorphic to the unit disk D={zC/|z|<1}, then by composition, the Souriau moment map is given by:

J:DO(ru3*)zJ(z)=r(z+z*(1|z|2)u1*+zz*i(1|z|2)u2*+1+|z|2(1|z|2)u3*) (84)

J is linked to the natural action of G on D (by fractional linear transforms) but also the coadjoint action of G on O(ru3*)=G/K. J1 could be interpreted as the stereographic projection from the two-sphere S2 onto C [56]. In case r=n2 where nN+,n2 then the coadjoint orbit is given by On=O(ζn) with ξn=n2u3*g*, with stabilizer of ξn for coadjoint action the torus K={(eiθ00eiθ),θR} with Lie algebra Ru3. On=O(ζn) is associated with a holomorphic discrete series representation πn of G by the KKS (Kirillov-Kostant-Souriau) method of orbits.

J:DOnzJ(z)=n2(z+z*(1|z|2)u1*+zz*i(1|z|2)u2*+1+|z|2(1|z|2)u3*) (85)

Group G act on D by homography g.z=(abb*a*).z=az+bb*z+a*. This action corresponds with coadjoint action of G on On. The Kirillov-Kostant-Souriau 2-form of On is given by:

Ωn(ζ)(X(ζ),Y(ζ))=ζ,[X,Y] , X,Yg and ζOn (86)

and is associated in the frame by J with:

ωn=in(1|z|2)2dzdz* (87)

with the corresponding Poisson Bracket:

{f,g}=i(1|z|2)2(fzgz*fz*gz) (88)

It has been also observed that there are 3 basic observables generating the SU(1,1) symmetry on classical level:

{DRzk3(z)=1+|z|21|z|2,{DRzk1(z)=1izz*1|z|2,{DRzk2(z)=z+z*1|z|2 (89)

with the Poisson commutation rule:

{k3,k1}=k2,{k3,k2}=k1,{k1,k2}=k3 (90)

(k1,k2,k3) vector points to the upper sheet of the two-sheeted hyperboloid in R3 given by k32k12k22=1, whose the stereographic projection onto the open unit disk is:

{(k1,k2,k3)H+Dz=k2+ik11+k3=k31k3+1eiargz (91)

Under the action of gG=SU(1,1)={(abb*a*)/|a|2|b|2=1, a,bC}:

(kk3k3k+)=(k2+ik1k3k3k2ik1)=11|z|2(2z1+|z|21+|z|22z*) is transform in:

(kk3k3k+)=(k(g1.z)k3(g1.z)k3(g1.z)k+(g1.z))=g1(kk3k3k+)(g1)t (92)

This transform can be viewed as the co-adjoint action of SU(1,1) on the coadjoint orbit identified with k32k12k22=1. We can also observe that the quotient SU(1,1)/K is isomorphic to the upper sheet of the hyperboloid described by k32k12k22=1, by the following parametrization (τ,φ), given by n=(coshτ,sinhτcosφ,sinhτsinφ), and its stereographic projection onto the inside of the unit disk, parametrized by ς=tanhτ2eiφ.

6. Covariant Gibbs Density by Souriau Thermodynamics for Poincaré Unit Disk

6.1. Fourier Transform, Laplace Transform and Lie Group Representation Theory

In Souriau Lie Group Thermododynamic, we have to consider Laplace Transform defined on coadjoint orbits to define Massieu Potential Function and Gibbs density. This problem has been solved in the domain of Kirillov Representation Theory. Representation theory studies abstract algebraic structures by representing their elements as linear transformations of vector spaces, and algebraic objects (Lie groups, Lie algebras) by describing its elements by matrices and the algebraic operations in terms of matrix addition and matrix multiplication, reducing problems of abstract algebra to problems in linear algebra. Representation theory generalizes Fourier analysis via harmonic analysis. The modern development of Fourier analysis during XXth century has explored the generalization of Fourier and Fourier-Plancherel formula for non-commutative harmonic analysis, applied to locally compact non-Abelian groups. This has been solved by geometric approaches based on “orbits methods” (Fourier-Plancherel formula for G is given by coadjoint representation of G in dual vector space of its Lie algebra) with many contributors (Dixmier, Kirillov, Bernat, Arnold, Berezin, Kostant, Souriau, Duflo, Guichardet, Torasso, Vergne, Paradan, etc.) [57,58,59,60,61,62,63,64,65,66,67,68].

For classical commutative harmonic analysis, we consider the following groups:

G=Τn=Rn/Zn for Fourier series, G=Rn for Fourier TransformG group character (linked to eikx):χ:GU with U={zC/|z|=1}G^={χ/χ1.χ2(g)=χ1(g)χ2(g)} and Fourier transform is given by:φ:GCφ^:G^Cgφ(g)=G^φ^(χ)χ(g)1dχχφ^(χ)=Gφ(g)χ(g)dg (93)

For non-commutative harmonic analysis, Group unitary irreductible representation is U:GU(H) with H Hilbert space and character by χU(g)=trUg. Fourier transform for non-commutative group is Uφ=Gφ(g)Ugdg with character χU(g)=trUφ. If we describe group element with exponential map Uψ=gψ(X)Uexp(X)dX, we have:

trUψ=dimτ.μG.f(ψ.j1)ψ.j1:gg*, Four. Transf. with {μG.f: Liouville meas. on O=G.f,fg*μG.f(ψ.j1): Integral of ψ.j1wrt μG.f  (94)

where

j(X)=(dets(adX))1/2 with s(x)=n=01(2n+1)!(x2)2n=sh(x2)/(x2) (95)

Kirillov Character formula is:

χU(exp(X))=trUexp(X)=j(X)1Oeif,XdμO(f) (96)
Oeif,XdμO(f)=j(X)trUexp(X) with j(X)=(det(eadX/2eadX/2adX/2))1/2  (97)

We will use Kirillov representation theory and his character formula to compute Souriau covariant Gibbs density in the unit Poincaré disk. For any Lie group G, a coadjoint orbit Og* has a canonical symplectic form ω0 given by KKS 2-form. As seen, if G is finite dimensional, the corresponding volume element defines a G-invariant measure supported on O, which can be interpreted as a tempered distribution. The Fourier transform (where d is the half of the dimension of the orbit O):

(x)=Og*eiλ,x1d!dωOd with λg* and xg (98)

is Ad G-invariant. When Og* is an integral coadjoint orbit, Kirillov formula, given previously, expresses Fourier transform (x) by Kirillov character χO:

(x)=j(x)χO(ex) where j(x)=det1/2(sinh(ad(x/2))ad(x/2)) (99)

χO is, as defined previously, the “Kirillov character” of a unitary representation associated to the orbit.

6.2. Souriau Covariant Gibbs Density in Poincaré Unit Disk for SU(1,1) Lie Group

In the following, we will give the full development to compute the Souriau covariant Gibbs density. As the Gibbs density is not defined for all geometric temperature, as observed by Souriau, we have used his approach by considering a one-parameter subgroup of the Lie group generated by exponential map from a one element of Lie algebra given by geometric temperature. The subset of Lie algebra where the Gibbs density is deduced from the contraints related to this one-parameter subgroup generation.

Considering the Lie group SU(1,1)={(abb*a*)/a,bC, |a|2|b|2=1} and its Lie algebra given by elements su(1,1)={(irηη*ir)/rR,ηC}. A basis for this Lie algebra su(1,1) is (u1,u2,u3)g with u1=i2(1001),u2=12(0110) and u3=12(0ii0) with [u1,u3]=u2,[u1,u2]=u3,[u2,u3]=u1.

The compact subgroup is generated by u1, while u2 and u3 generate a hyperbolic subgroup. The dual space of the Lie algebra is given by su(1,1)*={(zx+iyx+iyz)/x,y,zR} with the basis (u1*,u2*,u3*)g* with u1*=(1001),u2*=(0ii0) and u3*=(0110).

Let consider D={zC/|z|<1} be the open unit disk of Poincaré. For each ρ>0, the pair (D,ωρ) is a symplectic homogeneous manifold with ωρ=2iρdzdz*(1|z|2)2, where ωρ is invariant under the action: SU(1,1)×DD(g,z)g.z=az+bb*z+a*.

This action is transitive and is globally and strongly Hamiltonian. Its generators are the hamiltonian vector fields associated to the functions:

J1(z,z*)=ρ1+|z|21|z|2 , J2(z,z*)=ρizz*1|z|2 , J3(z,z*)=ρz+z*1|z|2 (100)

The associated moment map J:Dsu*(1,1) defined by J(z).ui=Ji(z,z*), maps D into a coadjoint orbit in su*(1,1). Then, we can write the moment map as a matrix element of su*(1,1):

J(z)=J1(z,z*)u1*+J2(z,z*)u2*+J3(z,z*)u3*=(ρ1+|z|21|z|2ρzz*1|z|2ρz+z*1|z|2ρzz*1|z|2+ρz+z*1|z|2ρ1+|z|21|z|2)J(z)=J1(z,z*)u1*+J2(z,z*)u2*+J3(z,z*)u3*=ρ(1+|z|21|z|22z*1|z|22z1|z|21+|z|21|z|2)g* (101)

The moment map J is a diffeomorphism of D onto one sheet of the two-sheeted hyperboloid in su*(1,1), determined by J12J22J32=ρ2 , J1ρ with J1u1*+J2u2*+J3u3*su*(1,1). We note Oρ+ the coadjoint orbit AdSU(1,1)* of SU(1,1), given by the upper sheet of the two-sheeted hyperboloid given by previous equation. The orbit method of Kostant-Kirillov-Souriau associates to each of these coadjoint orbits a representation of the discrete series of SU(1,1), provided that ρ is a half integer greater or equal than 1 (ρ=k2,kN and ρ1). When explicitly executing the Kostant-Kirillov construction, the representation Hilbert spaces Hρ are realized as closed reproducing kernel subspaces of L2(D,ωρ). The Kostant-Kirillov-Souriau orbit method shows that to each coadjoint orbit of a connected Lie group is associated a unitary irreducible representation of G acting in a Hilbert space H.

Souriau has oberved that action of the full Galilean group on the space of motions of an isolated mechanical system is not related to any equilibrium Gibbs state (the open subset of the Lie algebra, associated to this Gibbs state is empty). The main Souriau idea was to define the Gibbs states for one-parameter subgroups of the Galilean group. We will use the same approach, in this case We will consider action of the Lie group SU(1,1) on the symplectic manifold (M,ω) (Poincaré unit disk) and its momentum map J are such that the open subset Λβ={βg/DeJ(z),βdλ(z)<+}  is not empty. This condition is not always satisfied when (M, ω) is a cotangent bundle, but of course it is satisfied when it is a compact manifold. The idea of Souriau is to consider a one parameter subgroup of SU(1,1). To parametrize elements of SU(1,1) is through its Lie algebra. In the neighborhood of the identity element, the elements of gSU(1,1) can be written as the exponential of an element β of its Lie algebra:

g=exp(εβ) with βg (102)

The condition g+Mg=M for M=(1001) can be expanded for ε<<1 and is equivalent to β+M+Mβ=0 which then implies β=(irηη*ir),rR,ηC. We can observe that r and η=ηR+iηI contain 3 degrees of freedom, as required. Also because detg=1, we get Tr(β)=0. We can then exponentiate β with exponential map to get:

g=exp(εβ)=k=0(εβ)kk!=(aε(β)bε(β)bε*(β)aε*(β)) (103)

If we make the remark that β2=(irηη*ir)(irηη*ir)=(|η|2r2)I, we can developed the exponential map:

g=exp(εβ)=(cosh(εR)+irsinh(εR)Rηsinh(εR)Rη*sinh(εR)Rcosh(εR)irsinh(εR)R) with R2=|η|2r2 (104)

We can observe that one condition is that |η|2r2>0 then the subset to consider is Λβ={β=(irηη*ir),rR,ηC/|η|2r2>0}  such that DeJ(z),βdλ(z)<+. The generalized Gibbs states of the full SU(1,1) group do not exist. However, generalized Gibbs states for the one-parameter subgroups exp(αβ), βΛβ, of the SU(1,1) group do exist. The generalized Gibbs state associated to β remains invariant under the restriction of the action to the one-parameter subgroup of SU(1,1) generated by exp(εβ).

To go futher, we will develop the Souriau Gibbs density from the Souriau moment map J(z) and the Souriau temperature βΛβ . If we note b=11|z|2[1z], we can write the moment map:

J(z)=ρ(1+|z|21|z|22z*1|z|22z1|z|21+|z|21|z|2)=ρ(2Mbb+Tr(Mbb+)I) with M=[1001] (105)

We can the write the covariant Gibbs density in the unit disk given by moment map of the Lie group SU(1,1) and geometric temperature in its Lie algebra βΛβ:

pGibbs(z)=eJ(z),βDeJ(z),βdλ(z) where dλ(z)=2iρdzdz*(1|z|2)2 (106)
pGibbs(z)=eρ(2bb+Tr(bb+)I),βDeJ(z),βdλ(z)=eρ(1+|z|2(1|z|2)2z*(1|z|2)2z(1|z|2)1+|z|2(1|z|2)),(irηη*ir)DeJ(z),βdλ(z) (107)

To write the Gibbs density with respect to its statistical moments, we have to express the density with respect to Q=E[J(z)]. Then, we have to invert the relation between Q and β, to replace this last variable β=(irηη*ir)Λβ by β=Θ1(Q)g where Q=Φ(β)β=Θ(β)g* with Φ(β)=logDeJ(z),βdλ(z), deduce from Legendre tranform. The mean moment map is given by:

Q=E[J(z)]=E[ρ(1+|w|2(1|w|2)2w*(1|w|2)2w(1|w|2)1+|w|2(1|w|2))] where wD (108)

This mean moment map can be obtained by Karcher mean computation on the one-sheet hyperboloid corresponding to the coadjoint orbit. For the dual pairing, we can observed that J(z)=J1(z,z*)u1*+J2(z,z*)u2*+J3(z,z*)u3*g* with J1(z,z*)=ρ1+|z|21|z|2, J2(z,z*)=ρizz*1|z|2, J3(z,z*)=ρz+z*1|z|2 and β=β1u1+β2u2+β3u3g* with β=2(r,ηR,ηI) , η=ηR+iηI.

The integral of normalization in Gibbs density could be computed through Kirillov character formula by χm(exp((x..x)))=j(x)1Om1+ei(x..x),(irηη*ir)ωOm1+ where

j(x)=det1/2[sinh(ad(x/2x/2))/ad(x/2x/2)]=sinh(x)x

with following relation emx1e2xj(x)=De(m1)x1+|w|21|w|21(1|w|2)2dwdw*.

Recently, Enrico De Micheli [69] has introduced a Laplace-type transform (the so-called Spherical Laplace Transform) with a connection to the Non-Euclidean Fourier Transform in the sense of Helgason, and the principal series of the unitary representation of SU(1,1).

6.3. Extension to SU (p,q) Unitary Group for Siegel Unit Disk

Mode details are given in Appendix B, on parameterization of SU(1,1) and extension to SU (p,q). To address computation of covariant Gibbs density for Siegel Unit Disk, we will consider in this section SU(p,q) Unitary Group:

G=SU(p,q) and K=S(U(p)×U(q))={(A00D)/AU(p),DU(q),det(A)det(D)=1} (109)

We can use the following decomposition for gGC:

g=(ABCD)GC,g=(IpBD10Iq)(ABD1C00D)(Ip0D1CIq) (110)

and consider the action of gGC on Siegel Unit Disk SD={ZMpq(C)/IpZZ+>0} given by:

g=(ABCD)GC,g=(IpBD10Iq)(ABD1C00D)(Ip0D1CIq) (111)

Benjamin Cahen has study this case and introduced the moment map by identifing G-equivariantly g* with g by means of the Killing form β on gC:

g* Gequivariant with gby Killing form β(X,Y)=2(p+q)Tr(XY) 

The set of all elements of g fixed by K is 𝖍:

𝖍={element of G fixed by K} , ξ0𝖍,ξ0=iλ(qIp00pIq)ξ0,[Z,Z+]=2iλ(p+q)2Tr(ZZ+),ZD (112)

Then, we the equivatiant moment map is given by:

XgC , ZD,ψ(Z)=Ad*(exp(Z+)ζ(expZ+expZ))ξ0gG,ZD then ψ(g.Z)=Adg*ψ(Z)ψ is a diffeomorphism from SD onto orbit O(ξ0) (113)

with:

ψ(Z)=iλ((IpZZ+)1(pZZ+qIp)(p+q)Z(IqZ+Z)1(p+q)(IqZ+Z)1Z+(pIq+qZ+Z)(IqZ+Z)1) (114)
ζ(expZ+expZ)=(IpZ(IqZ+Z)10Iq) (115)

7. Lie Groups Thermodynamics for SE(2) Lie Group

After SU(1,1) Lie group with null cohomology and then without Souriau one-cocycle, we will consider Souriau model for SE(2) Lie group with non-null cohomology and then with introduction of Souriau one-cocycle [70].

We will consider first SO(2) Lie group:

SO(2)={Rφ=[cosφsinφsinφcosφ]/φR} (116)

A vector at the identity to SO(2) is given by:

dRtηdt|t=0=η with =[0110],T=1= (117)

We consider the special Euclidean group SE(2)=SO(2)×R2.

SE(2)={[Rφτ01]/RφSO(2),τR2} (118)

the group operation is given by:

[Rφ1τ101][Rφ2τ201]=[Rφ1Rφ2Rφ1τ2+τ101]=[Rφ1+φ2Rφ1τ2+τ101](R1,τ1).(Rφ2,τ2)=(Rφ1+φ2,Rφ1τ2+τ1) (119)
[Rφ1τ101]1=[Rφ1Rφ1τ101](Rφ1,τ1)1=(Rφ1,Rφ1τ1) (120)

The Lie algebra se(2) of SE(2) has underlying vector space R3 and Lie bracket:

(ξ,u)se(2)=R×R2[ξu00]se(2) (121)

Lie bracket is given by:

[(ξ,u),(η,v)]=(0,ξv+ηu) (122)

Adjoint action of SE(2) is given by:

Ad(Rφ,τ)(ξ,u)=[Rφτ01][ξu00][RφRφτ01]=[ξξτ+Rφu00]Ad(Rφ,τ)(ξ,u)=(ξ,Rφu+ξτ) (123)

Coadjoint action of SE(2) is given by:

Ad(Rφ,τ)*(m,ρ)=(m+Rφρ.τ,Rφρ) (124)

The moment map J:R2se*(2) of SE(2) is defined by:

J(ξ,u)(x)=J(x).(ξ,u) (125)

with the right action of SE(2) on R2:

x.(Rφ,τ)=Rφ(xτ) (126)

the infinitesimal generator of (ξ,u)se(2) has the expression:

(ξ,u)R2(x)=d[x.(Rtξ,tu)]dt|t=0=d[Rtξ(xtu)]dt|t=0=ξxu (127)

Let J(ξ,u)(x):R2se*(2) be the moment map of this action relative to the symplectic form, we can compute it from its definition:

dJ(ξ,u)(x).y=2ω((ξ,u)R2,y)with ω((ξ,u)R2,y)=ω(ξxu,y)=(ξxu).y=(ξx+u).ydJ(ξ,u)(x).y=2(ξx+u).yJ(ξ,u)(x)=2(12ξx2+u.x)=2(12x2,x).(ξ,u)J(ξ,u)(x)=J(x).(ξ,u)J(x)=2(12x2,x) , xR2 (128)

We then compute the one-cocycle of SE(2) from the moment map:

θ((Rφ,τ))=J(0.(Rφ,τ))Ad(Rφ,τ)*J(0)=J(Rφτ)θ((Rφ,τ))=2(12τ2,Rφτ)=2(12τ2,Rφπ2τ) (129)

Coadjoint orbit of SE(2) are generated by:

O(m,ρ)={A(Rφ,τ)*(m,ρ)+θ((Rφ,τ))/(Rφ,τ)SE(2)}O(m,ρ)={(xRπ2ρ.ττ2,Rφρ2Rφπ2τ)/(Rφ,τ)SE(2)} (130)

The Souriau Symplectic form in this case of non-null cohomology is given by:

ω(m,ρ)(m,ρ)(ad(ξ,u)*(m,ρ)(0,2u),ad(η,v)*(m,ρ)(0,2v))=ρ.(ξv+ηu)+2u.vwith (m,ρ)=(xRπ2ρ.ττ2,Rφρ2Rφπ2τ)O(m,ρ)R3 (131)

With the expression of moment map, we can compute Souriau covariant Gibbs density of Maximum Entropy.

Considering the symplectic form ω(ζ,υ)=ζ.υ with =[0110] on R2, we have seen that the action of SE(2) is symplectic and admits the momentum map, J(x)=(12x2,x) , xR2.

Souriau Gibbs density is defined for generalized temperature βΩ={(b,Β)se(2)/b<0,ΒR2} and given by:

pGibbs(x)=eJ(x),βR2eJ(x),βdλ(x)=e12bx2Β.xR2e12bx2Β.xdλ(x) (132)

The Massieu Potential could be computed:

Φ(β)=logR2e12bx2Β.xdλ(x)=log(2πbe12bB2) (133)

By derivation of Massieu potential, we can deduce expression of Heat:

QΩ*={(m,M)se*(2)/m+M22<0}Q=Φ(β)β=(1bΒ22b2,1bΒ)=Θ(β) (134)

We can the inverse this relation to express generalized temperature with respect to the heat:

β=Θ1(Q)=((m+12M2)1,(m+12M2)1M) (135)

We can the express the Gibbs density with respect to the Heat Q which is the mean of moment map:

pGibbs(x)=e12x2M.x(m+12M2)R2e12x2M.x(m+12M2)dλ(x) with (m,M)=E(J(x))=E[2(12x2,x)]=[E(x2),2E(x)] (136)

So we can rewrite the Gibbs density:

pGibbs(x)=e12x2+2E(x).Ix(E(x2)+2E(x)2)R2e12x2+2E(x).Ix(E(x2)+2E(x)2)dλ(x) (137)

We can also provide a Fisher metric in dual Lie algebra as hessian of the Entropy:

S(Q)=Q,βΦ(β)=1+log(2π)+log(mM22) (138)
IFisher(Q)=(m+12M2)1[IMTMT12M2m] (139)

and as (m,M)=E(J(x))=E[2(12x2,x)]=[E(x2),2E(x)], Fisher metric in dual space of Lie Algebra parameterization could be written:

IFisher(Q)=(2E(x)2E(x2))1[I(2E(x))T2E(x)2E(x)2+E(x2)] (140)

8. New Entropy Definition as Generalized Casimir Invariant Functions for Coadjoint and Adjoint Representation

In his paper written in 1974, Jean-Marie Souriau has observed that if we consider the heat expression Q=dΦdβ, that we can write δΦQ,δβ=0. For each δβ tangent to the orbit, and so generated by an element Z of the Lie algebra, if we consider the relation Φ(Adg(β))=Φ(β)θ(g1),β and we differentiate it at g=e using the property that Θ˜(X,Y)=dθ(X),Y , X,Yg, we obtain Q,[β,Z]+Θ˜(β,Z)=0. Souriau has stopped by this last equation, the characterization of Group action on Q=Φβ. Souriau has also observed that S[Q(Adg(β))]=S[Adg*(Q)+θ(g)]=S(Q). We propose to characterize more explicitly this invariance, by characterizing Entropy as an invariant Casimir function in coadjoint representation.

From last Souriau equation, if we use the identities β=SQ, adβZ=[β,Z] and Θ˜(β,Z)=Θ(β),Z, then we can deduce that adSQ*Q+Θ(SQ),Z=0,Z. So, Entropy S(Q) should verify adSQ*Q+Θ(SQ)=0, characterizes an invariant Casimir function in case of non-null cohomology, that we propose to write with Poisson brackets, {S,H}Θ˜(Q)=0 where {S,H}Θ˜(Q)=Q,[SQ,HQ]+Θ˜(SQ,HQ)=0, H:g*R,Qg*.

In a Poisson manifold, Casimir functions SC(g*), in case of null cohomology, are functions whose Poisson brackets will all functions vanish, {S,H}(Q)=0 ,SC(g*),Qg*. In the dual of the Lie algebra of a connected Lie group G, the Casimir functions are the Ad*-invariant functions, because if S,HC(g*) and Qg*, then {S,H}(Q)=Q,[SQ,HQ]=Q,adSQHQ=adSQ*Q,HQ vanishes for all HC(g*) if and only if adSQ*Q=0. A function is S on g* is Ad*-invariant if g.S=S, gG where Lie group G acts on functions on g* by (g.S)(Q)=S(Adg*Q), Qg*,SC(g*),gG, and where infinitesimal characterizations of Ad*-invariant functions on g*, ddtS(Adexp(tx)*Q)|t=0=adx*Q,SQ=adSQ*Q,x. The symplectic leaves of a Poisson manifold are contained in the connected components of the level sets of the Casimir functions and Casimir function is constant on a symplectic leaf. Coadjoint orbits lie on level sets of the Casimir functions, which are conserved quantities. Casimir functions Level sets are symplectic manifolds. Coadjoint motion of the moment map Q(t)=Adg(t)*Q(0) for a solution curve g(t)C(G) take place on the intersections of levels sets of the Hamiltonian and the Casimir functions. Alexis Arnaudon has studied stochastic coadjoint processes whose solutions lie on coadjoint orbits.

We have observed that {S,H}Θ˜(Q)=Q,[SQ,HQ]+Θ˜(SQ,HQ)=0,H:g*R,Qg*, that shows that Souriau Entropy is a Casimir function in case with non-null cohomology when an additional cocycle should be taken into account. Indeed, infinitesimal variation is characterized by the following differentiation: ddtS(Q(Adexp(tx)β))|t=0=ddtS(Adexp(tx)*Q+θ(exp(tx)))|t=0=adSQ*Q+Θ(SQ),x. We recover extended Casimir equation in case of non-null cohomology verified by Entropy, adSQ*Q+Θ(SQ)=0, and then the generalized Casimir condition {S,H}Θ˜(Q)=0. Hamiltonian motion on these affine coadjoint orbits is given by the solutions of the Lie-Poisson equations with cocycle.

The identification of Entropy as an Invariant Casimir Function in Coadjoint representation is also important in Information Theory, because classically Entropy is introduced axiomatically. With this new approach, we can build Entropy by constructing the Casimir Function associated to the Lie group and also in case of non-null cohomology. Igor V. Shirokov [71,72,73,74,75] has proposed a method for constructing invariants of the coadjoint representation of Lie groups with an arbitrary dimension and structure based on local symplectic coordinates on the coadjoint orbits. The idea of the method of constructing coadjoint invariants is to construct the canonical transition to the Darboux coordinates on the orbits of the dual Lie algebra g* of maximal dimension dual to the Lie algebra g of the Lie group G. These relations provide invariants of the coadjoint representation of the Lie group G.

This geometric framework unifies several earlier works on the subject, including Souriau’s symplectic model of statistical mechanics, and approaches developed in Information Geometry and Quantum Information Geometry. This approach helps to identify the common geometric structures appearing in various domains from statistical mechanics to statistical learning. The emphasis is put on the role of the affine equivariance with respect to Lie group actions, as extension of the Fisher metric in presence of equivariance and the associated Lie-Poisson equations with cocycle (affine Lie-Poisson equations). The entropy of the Souriau model as a Casimir function can be used to apply a geometric model for energy preserving entropy production on Lie algebras. We can exploit the geometric framework of this new equation to build geometric numerical integrator schemes for some of the equations associated to Souriau’s model and its polysymplectic extension. This new equation is important because it introduce new structure of differential equations in case of non-null cohomology and for an arbitrary Hamiltonian H:g*R: dQdt=adHQ*Q+Θ(HQ).

The equation dQdt=adHQ*Q+Θ(HQ) is important because it allows extending stochastic perturbation of the Lie-Poisson equation with cocycle within the setting of stochastic Hamiltonian dynamics, which preserves the affine coadjoint orbits. We can extend model for stochastic geometric modeling in fluid dynamics via variational principles described in [32,76]. This extension results in the new Stratonovich differential equation for the stochastic process dQ+[adHQ*Q+Θ(HQ)]dt+i=1N[adHiQ*Q+Θ(HiQ)]dWi(t)=0.

This new equation is also very usefull for geometric symplectic Lie group integrator for Lie-Poisson equations with cocycle that preserves the affine coadjoint orbits for general Hamiltonian. This equation is also very relevant in the framework of dynamics with Casimir dissipation/production, to formulate a dynamical geometric model for dissipation/production of this Casimir. This allows to extend the general Lie algebraic approach developed in [77,78] for Casimir dissipation, to take into account of a cocycle, and to a wider class of dissipation. Paper [17] will exploit this new Casimir equation in case of non-null cohomology.

This equation dQdt=adHQ*Q+Θ(HQ) could be used also to make the link with 2nd principle of Thermodynamique, that will be deduced from positivity of Souriau-Fisher metric:

S(Q)=Q,βΦ(β) with dQdt=adHQ*Q+Θ(HQ)dSdt=Q,dβdt+adHQ*Q+Θ(HQ),βdΦdt=Q,dβdt+adHQ*Q,β++Θ(HQ),βdΦdtdSdt=Q,dβdt+Q,[HQ,β]+Θ˜(HQ,β)dΦdt=Q,dβdt+Θ˜β(HQ,β)Φβ,dβdtdSdt=Q,dβdt+Θ˜β(HQ,β)Φβ,dβdt with Φβ=QdSdt=Θ˜β(HQ,β)0,H (link to positivity of Fisher metric)if H=SSQ=QdSdt=Θ˜β(β,β)=0 because βKerΘ˜β (141)

Entropy production is then linked with Souriau-Fisher structure, dS=Θ˜β(HQ,β)dt with Θ˜β(HQ,β)=Θ˜(HQ,β)+Q,[HQ,β]  Souriau tensor related to Fisher metric.

8.1. Casimir Invariant and Generalized Casimir Invariant

Hendrik Brugt Gerhard Casimir, a Dutch physicist, studied what is called Casimir operators and Casimir invariants (H. Casimir and Van der Waerden studied the SU(2) group, the group of isospin/angular momentum, as the model of the algebraic approach to the study of the unitary representations of semi-simple compact Lie groups). Kirillov has explained that Casimir operators are in one-to-one correspondence with polynomial invariants characterizing orbits of the coadjoint representation. Solutions are not necessarily polynomials and the nonpolynomial solutions are called generalized Casimir invariants. For certain classes of Lie algebras, all invariants of the coadjoint representation are functions of polynomial ones. In physics, Hamiltonians and integrals of motion of classical integrable Hamiltonian systems are not polynomials in the momenta [71,72,73,74,75,79,80,81,82,83,84,85,86,87,88,89,90,91,92].

8.2. Souriau Entropy as Generalized Casimir Invariant in Coadjoint Representation

In Souriau Lie groups Thermodynamics, we will see that coadjoint orbits lie on level sets of the Entropy that could be considered as a Casimir invariant function:

S:g*RQS(Q) (142)

We will consider first the case of null-cohomology, Entropy as Casimir invariant function is a conserved quantity, because Casimir function has null Lie Poisson brackets functions [93,94]:

{S,H}(Q)=Q,[SQ,HQ]=0,H:g*R,Qg* , A,B=B(A,B) Cartan-Killing formwith S(Q)=ddεS(Q+δQ)|ε=0=δQ,SQ (143)

We can observe that β=SQ, then:

Q,[β,HQ]=Q,adβHQ=0,H:g*R,Qg* , adab=[a,b] (144)

We can also write:

Q,[SQ,HQ]=Q,adSQHQ=adSQ*Q,HQ=0 , H:g*R (145)

It means that adSQ*Q=adβ*Q=0 , β=SQ. We can remark that if we note (adSQ*Q)j=Cijkad(SQ)i*Qk=0 with Cijk the structure tensor, we observe that this equation is in fact the Casimir condition for invariant function in coadjoint representation as we will see hereafter. The restriction of the Lie-Poisson bracket to an orbit generates a symplectic structure on the orbit, called the KKS (Kirillov-Kostant-Souriau) structure, or the canonical symplectic structure. Casimir function is characterized as a quantity which commutes with each linear functional on the Poisson manifold, and then it is conserved by dynamics of any Hamiltonian.

Given a Hamiltonian H:g*R, the equation of motion for Qg* is:

dQdt={Q,H}=adHQ*Q with H=SdQdt={Q,S}=adSQ*Q=0 (146)

In case of non-null cohomology, the Lie Poisson brackets functions are given by:

{S,H}Θ˜(Q)=Q,[SQ,HQ]+Θ˜(SQ,HQ)=0,H:g*R,Qg*with Θ˜(X,Y)=J[X,Y]{JX,JY} where JX(x)=J(x),XΘ˜(X,Y):g×gwith Θ(X)=Teθ(X(e))X,YΘ(X),Y (147)

That we can develop in the following:

{S,H}Θ˜(Q)=Q,[SQ,HQ]+Θ(SQ),HQ=0{S,H}Θ˜(Q)=Q,adSQHQ+Θ(SQ),HQ=0{S,H}Θ˜(Q)=adSQ*Q,HQ+Θ(SQ),HQ=0H,{S,H}Θ˜(Q)=adSQ*Q+Θ(SQ)+,HQ=0adSQ*Q+Θ(SQ)=0 (148)

We have found the generalized Casimir equation for Entropy in the non-null cohomology case:

{S,H}Θ˜(Q)=0 (149)

That could be also written:

adSQ*Q+Θ(SQ)=0 (150)

This equation was observed by Souriau in his paper of 1974, where he has written that geometric temperature β is a kernel of Θ˜β, that is written:

βKerΘ˜βQ,[β,Z]+Θ˜(β,Z)=0 (151)

That we can develop to recover the Casimir equation:

Q,adβZ+Θ˜(β,Z)=0adβ*Q,Z+Θ˜(β,Z)=0β=SQadSQ*Q,Z+Θ˜(SQ,Z)=adSQ*Q+Θ(SQ),Z=0,ZadSQ*Q+Θ(SQ)=0 (152)

Then the generalized Casimir Equation in non-null cohomogy is given by:

(adSQ*Q)j+Θ(SQ)j=Cijkad(SQ)i*Qk+Θj=0 (153)

Given a Hamiltonian H:g*R, the equation of motion for Qg* is:

dQdt=adHQ*Q+Θ(HQ) with H=SdQdt=adSQ*Q+Θ(SQ)=0 (154)

Level sets of the Casimir Entropy function, on which the coadjoint orbits lie, are symplectic manifolds.

8.3. Souriau Entropy Invariance in Coadjoint Representation

If we note 𝕬𝖓(g*) the space of analytic function on the dual space of the Lie agebra g*, a function F*𝕬𝖓(g*) is a Casimir invariant if for any gG,Xg*, we have F*(Adg*X)=F*(X). We have observed previously that Souriau’s Entropy analytic function S(Q) defined on dual space of the Lie algebra g* by Legendre transform of Massieu Characteric analytic function Φ(β) (minus logarithm of Laplace transform) defined on Lie algebra g was an invariant function under the affine coadjoint action S[Q(Adg(β))]=S[Adg*(Q)+θ(g)]=S(Q). In case of null-cohomology, Souriau cocycle cancels θ(g)=0, and we recover Casimir invariant function in coadjoint representation S[Adg*(Q)]=S(Q).

We can then observe that Souriau Entropy is an extended Casimir invariant function in case of non-null cohomogy. This characteristic of Souriau Entropy could be a new characterization of Entropy. In Souriau Lie groups Thermodynamics, Entropy S(Q) is a generalized Casimir invariant function for coadjoint representation in case of non-null cohomology, and Massieu Characteristic function by Legendre duality is a generalized Casimir function for adjoint representation.

We will explain how to prove that Souriau Entropy is invariant under the action of the group, starting from its definition:

S(Q)=Q,βΦ(β) with Q=Φ(β)βg* and β=S(Q)Qg (155)

with

Φ(β)=logMeU(ξ),βdλω and U:Mg* (156)

Considering Souriau Entropy S(Q) where the heat Q=Φ(β)βg* an element of the dual space of the Lie algebra is parameterized by βg an element of the Lie algebra, the Lie group G acts through gG by adjoint operator Adg, the entropy is given by S[Q(Adg(β))] with Q(Adg(β)) given by fundamental Souriau equation:

Q(Adg(β))=Adg*(Q)+θ(g) (157)

The invariance of Souriau Entropy is deduced from the following developments:

βgAdg(β)Ψ(Adg(β))=MeU,Adg(β)dλωΨ(Adg(β))=MeAdg1*U,βdλω=MeU(Adg1β)θ(g1),βdλωΨ(Adg(β))=eθ(g1),βΨ(β)θ(g1)=Adg1*θ(g)Ψ(Adg(β))=eAdg1*θ(g),βΨ(β)Φ(β)=logΨ(β)Φ(Adg(β))=Φ(β)θ(g1),β=Φ(β)+Adg1*θ(g),β (158)

Based on this expression of Massieu Characteristic function transform by action of the group, we can use Legendre transform to study how Souriau Entropy is changed:

S(Q)=Q,βΦ(β)S(Q(Adgβ))=Q(Adgβ),AdgβΦ(Adgβ){Q(Adg(β))=Adg*(Q)+θ(g)Φ(Adg(β))=logΨ(Adg(β))=θ(g1),β+Φ(β)S(Q(Adgβ))=Adg*(Q)+θ(g),Adgβ+θ(g1),βΦ(β)S(Q(Adgβ))=Adg*(Q)+θ(g),AdgβAdg1*θ(g),βΦ(β)S(Q(Adgβ))=Adg1*Adg*(Q)+Adg1*θ(g),βAdg1*θ(g),βΦ(β)Adg1*Adg*(Q)=QS(Q(Adgβ))=Q,βΦ(β)=S(β) (159)

We finally prove that Souriau Entropy is invariant in coadjoint representation S(Adg*(Q)+θ(g))=S(β) in general case of non-null cohomology, that we could write S(Adg#(Q))=S(β), if we note affine coadjoint action Adg#(Q)=Adg*(Q)+θ(g). This is also true in case of null-cohomology when the Souriau cocycle cancels θ(g)=0, and we recover classical generalized Casimir invariant function definition on coadjoint representation for Entropy S(Adg*(Q))=S(β) generalized Casimir invariant function definition on adjoint representation for Massieu Characteristic function Φ(Adg(β))=Φ(β).

8.4. Souriau Entropy Given by Casimir Invariant Functions Equations

Based on development given in the following we can state that:

As the Entropy S is a generalized Casimir invariant function in the coadjoint representation, S(Adetξ*h)=S(h), then S should be solution of the following differential equation:

CijkQkS(Q)Qj=0 , i,j,k=dimg, with {CijkQk=Cij(Q)=BijBQ(x,y)=Bijxiyj=Q,[x,y] (160)

where Cijk is the structure tensor of the Lie algebra g in the basis (e1,e2,,en), while Xk are the coordinates in g* in the basis (e1,e2,,en) defined by ej,ei=δij. The structure tensor s given by [ϕ(ei),ϕ(ei)]=Cijkϕ(ek) with ϕ(ei)=CijkXkXj , i=1,,n.

8.5. Characterization of Generalized Casimir Invariant Functions in Coadjoint Representation

We will describe recent characterization of generalized Casimir invariant functions by Oleg L. Kurnyavko and Igor V. Shirokov [72,73,75] who have proposed Algebraic method for construction of Casimir invariants of Lie groups coadjoint representations (see Appendix C). Modern invariant theory based on geometric methods, which was credited classically as non-constructive, has some exception admitting a constructive solution related to the constructing invariants of Lie groups representations.

Let T be a connected Lie group, T(G) a representation of the group G in the linear space V, Tg the operators associated to the representation of the group G on the linear space V, then the invariants are given by the following equation:

F(Tgx)=F(x) , xV,gG,TgT(G),F(x)C(V) (161)

With the properties that:

Te=I , Tgagb=TgaTgb , Tg1=(Tg)1 (162)

Solution is given by the following differential equation:

i,jdimVtkjixjF(x)xi=0 with tkji=(Tg)jigk|g=e and k=1,,dimG (163)

tkji are elements of the matrices of the Lie algebra representation basis of G.

That we can write tk=tkjixjxi and tkF(x)=0.

If we consider the dual space V*, the co-tangent representation is given by:

T*(g)X,T(g)x=X,x (164)

And co-represnetation invariants are given by:

tk*F*(X)=0 with tk*=tkjiXiXj (165)

They have underlined the relationship between invariants of representations and conjugate representations, where the algebraic construction of Lie groups representations invariants are given by invariants of the conjugate representation with respect to the invariants of the original representation.

Shirokov Theorem 1. 

Let F(x) be a non-degenerate invariant of the representation T(G) , then conjugate representation invariant can be found by Legrendre tranform:

F*(X)=xiXiF(x)=x,XF(x) with X=F(x)x such that Xi=F(x)xi (166)

and also the converse problem:

F(x)=xiXiF*(X)=x,XF*(X) with x=F*(X)X such that xi=F*(X)Xi (167)

Shirokov has considered F(x) the representation invariant T(G), and F*(X) the representation invariant T*(G) conjugate to T(G), with the conditions:

tkjixjF(x)xi=0 and tljiXiF*(X)Xj=0 (168)
tljiXiF*(X)Xj=tljiXiXj[xk(X)XkF(x(X))]=tljiXixkXjXk+tljiXixkXkXjtljiXiF(x)xkxkXjtljiXiF*(X)Xj=tljiXixkXjF(x)xk+tljiF(x)xixkδkjtljiF(x)xiF(x)xkxkXjtljiXiF*(X)Xj=tljixjF(x)xi=0 (169)

Invariant Casimir Functions of the coadjoint representation has been studied for completely integrable Hamiltonian systems, as classical systems on the orbits of the coadjoint representation. Oleg L. Kurnyavko and Igor V. Shirokov have considered the relationship between invariants of representations of Lie groups and their conjugate dual representations.

Considering the coadjoint action given by:

Adg*X,x=X,Adg1x , gG,Xg*,xg (170)

Invariants of a coadjoint representation are called Casimir functions, with the property:

F*(Adg*X)=F*(X) (171)

the infinitesimal invariance is given by the equations:

Cij(X)F*(X)Xj=0 with Cij(X)=CijkXk, i,j,k=dimg (172)

The number of functionally independent invariants is given by the rank of the matrix Cij(X), called the index of the Lie algebra g: indg=dimg*supXg*rankCij(X).

From these adjoint and coadjoint representation, Shirokov has introduced the following theorem:

Shirokov Theorem 2. 

LetF(Adgx)=F(x)be a non-degenerate invariant of the adjoint representationAdG, then conjugate representation invariant, invariant of coadjoint representationAdG*can be found by formula:

F*(X)=xiXiF(x)=x,XF(x) with X=F(x)x such that Xi=F(x)xi (173)

and also the converse problem, letF*(Adg*X)=F*(X), invariant of coadjoint representationAdG is given by:

F(x)=xiXiF*(X)=x,XF*(X) with x=F*(X)X such that xi=F*(X)Xi (174)

Nota:

CijkXkF*(X)Xj=0 , i,j,k=dimg, with {CijkXk=Cij(X)=BijBX(x,y)=Bijxiyj=X,[x,y] (175)

8.6. Constructing Generalized Casimir Invariant Functions in Coadjoint Representation

I. V. Shirokov has proposed a method for constructing invariants of the coadjoint representation of Lie groups with an arbitrary dimension and structure based on local symplectic coordinates on the coadjoint orbits. Oleg L. Kurnyavko and Igor V. Shirokov have also proposed a general method for constructing Casimir invariants.

We will give some other developments of Casimir Invariant Functions by A.T. Fomenko and V.V. Trofimov, related to Orbits of the coadjoint representation and the associated canonical symplectic structure.

The coadjoint orbit Oh passing through the point hg* is given by

Oh={Adg*h/gG} where hg* (176)
ThOh={adρ*h/ρg}g*,hg*v=d[Adexptρ*h]dt|t=0ThOh , ρg (177)
Let (e1,,en) basis of g, (e1,,en) basis of g*, with ei,ej=δi,jh=hieivi=d[hi,Adexptρ*h]dt|t=0=d[Adexptρ*h,ei]dt|t=0vi=d[h,Adexptρei]dt|t=0=h,d[Adexptρei]dt|t=0vi=h,[ρ,ei]=adρ*h,ei=v,eiv=adρ*h (178)

Kirillov, Kostant and Souriau have introduced a KKS 2-form on co-adjoint co-orbits that then inherit a structure of homogeneous symplectic manifold:

ξ,ηThOh={adχ*h/ρg}g*,hg*ωh(ξ,η)=ω(adξ1*h,adη1*h)=h,[ξ1,η1] with ξ=adξ1*h and η=adη1*h (179)

This KKS 2-form ω is invariant with respect to the coadjoint action ωg(Adf*ξ,Adf*η)=ωh(ξ,η):

ωg(Adf*ξ,Adf*η)=ωg(Adf*adξ1*h,Adf*adη1*h)with g=Adf*h , ξ=adξ1*h , η=adη1*h and fG,g,hg*Adf*adξ1*h=adAdfξ1*(Adf*h) and Adf*adη1*h=adAdfη1*(Adf*h)ωg(Adf*ξ,Adf*η)=ωg(adAdfξ1*(Adf*h),adAdfη1*(Adf*h))ωg(Adf*ξ,Adf*η)=ωg(adAdfξ1*g,adAdfη1*g)=g,[Adfξ1,Adfη1]ωg(Adf*ξ,Adf*η)=g,Adf[ξ1,η1]=Adf1*g,[ξ1,η1] with h=Adf1*gωg(Adf*ξ,Adf*η)=h,[ξ1,η1]ωg(Adf*ξ,Adf*η)=ωh(ξ,η) (180)

The symplectic structure is given due to the property that dω=0, that could be proved making link with Jacobi identity.

Let gradskewm such that ω(v,gradskewm)=v(m)=ivimxi smooth vector field on M{m,n}=ω(gradskewm,gradskewn)=i<jωij(gradskewm)i(gradskewn)jwith (gradskewm)i=jωijmxj{m,n}=i<jωijmxinxj (181)

Jacobi identity can be computed:

{m,{n,p}}=(gradSkewm){n,p}=LgradSkewm{n,p} with Lζ: Lie derivativeIf ξ=gradSkewmLξ{n,p}=Lξ(ωijnxipxj)=Lξ(ω)ijnxipxj+ωij(ξn)xipxj+ωijnxi(ξp)xjLξ{n,p}=Lξ(ω)ijnxipxj+{ξn,p}+{n,ξp}=Lξ(ω)ijnxipxj{{m,n},p}{n,{m,p}}{m,{n,p}}+{{m,n},p}+{n,{m,p}}=Lξ(ω)ijnxipxj (182)

Using Elie Cartan formula Lξω=i(ξ)dω+di(ξ)ω. If ξ is a Hamiltonian vector field, di(ξ)ω=0 and then Lξω=i(ξ)dω. If dω=0, then the Jacobi identity is satisfied {m,{n,p}}+{{m,n},p}+{n,{m,p}}=0 and conversely.

Let consider the Berezin Bracket:

{m,n}=Cijkxkmxinxj with [ei,ej]=Cijkekwhere (e1,e2,,en) basis of Lie algebra g,(e1,e2,,en)basis of dual Lie algebra g*of corresponding coordinates x1,,xn for g, x1,,xn for g* (183)

This Berezin Bracket is given by:

{m,n}x=dmx(addn(x)*(x))=(addn(x)*(x))(dmx){m,n}x=x,[dnx,dmx]=Cijkxknximxj with dnx=nxiei , dmx=mxjej (184)

By developping Berezin Bracket {m,n}=Cijkxkmxinxj with [ei,ej]=Cijkek, we can prove that the bracket verify jacoby identy {m,{n,p}}+{{m,n},p}+{n,{m,p}}=0 and then dω=0.

We will see that differential equation for (semi-)invariants of the coadjoint representations could be established. We will note 𝕬𝖓(g*) the space of analytic function on the dual space of the Lie agebra g*. A function F*𝕬𝖓(g*) is an invariant if for any gG,Xg*, we have F*(Adg*X)=F*(X), and is semi-invariant if F*(Adg*X)=χ(g)F*(X) where χ(g) is a character of the Lie group G.

We have a representation of Lie algebras ϕ:gVec(Γ) defined on basis (e1,e2,,en) in g where Vec(Γ) is the space of vector fields on Γ an open subset in g*, given by:

ϕ(ei)=CijkXkXj , i=1,,n (185)

where Cijk is the structure tensor of the Lie algebra g in the basis (e1,e2,,en), while Xk are the coordinates in g* in the basis (e1,e2,,en) defined by ej,ei=δij. The representation is not dependent of the choice of the basis, with the property: [ϕ(ei),ϕ(ei)]=Cijkϕ(ek).

We have the property, that:

dnF*(Adetξ*h)dtn|t=0=[(ϕ(ξ))nF*](h) (186)

This result is obtained by the following development:

dF*(Adetξ*h)dt|t=0=F*Xi(h).dAdetξ*h,Xidt|t=0=F*Xi(h).dh,Adetξeidt|t=0dh,Adetξeidt|t=0=h,[ξ,ei]=h,Cijkξjek with [ξ,ei]=[ξjej,ei]=Cjikξjekdh,Adetξeidt|t=0=hkek,Cjikξjek=CjikξjhkdF*(Adetξ*h)dt|t=0=CjikξjhkF*Xi(ξ)=(ϕ(ξ)F*)(h) (187)

We use then Taylor expansion of F*(Adetξ*h) given by:

F*(Adetξ*h)=F*(h)+n=1(ϕ(ξ))nF*n!(h).tn (188)

We can observe that F* is invariant if F*(Adetξ*h)=F*(h) and then (ϕ(ξ))nF*=0 or ϕ(ξ)F*=0 that could be written CjikξjhkF*Xi(ξ)=0.

If F* is semi-invariant of the coadjoint representation of group if and only if:

ϕ(ei)F*=λiF* with λi=dχ(ei) (dχ : derivative of χ at the group G identity element
F*(Adetξ*h)=χ(etξ)F*(h) with χ(etξ)=etχ*(ξ)[(ϕ(ξ))nF*](h)=χ*(ξ)F*(h)F*(Adetξ*h)=[1+n=1[χ*(ξ)]nn!tn].F*(h) (189)

9. Conclusion: Lie Groups Thermodynamics for Machine Learning

With Lie groups Thermodynamics, we have presented Souriau tools to extend Gibbs density for Lie groups [95,96,97,98,99,100,101,102,103,104,105,106,107]. We can make reference to other explorations of Lie Group Representation theory to built exponential families [108,109,110,111] or Information Geometry in Quantum Physics [112,113,114,115,116,117,118,119,120,121,122,123]. Gibbs density estimation is a basic tool in statistical macine learning. Classically, we can associate to any posterior distribution an effective generalized geometric temperature, given by an element of the dual space of the Lie algebra, relating it to the Gibbs prior distribution. Classification rules could be introduced by Gibbs measures defined on parameter sets and depending on the observed sample value. A Gibbs measure is a special kind of probability measure used in statistical mechanics to describe the state of a particle system driven by a given energy function at some given temperature. Gibbs measures will be realized as minimizers of the average loss value under entropy constraints. In this extension for Lie groups, an important tool is the log-Laplace transform related to the Massieu Characteristic Function in Thermodynamics (a re-parameterization of the free energy by Planck temperature preserving Legendre transform with respect to Entropy). As we want to deal with Lie group data for Machine Learning, we will consider tools very similar to those used in statistical mechanics to describe particle systems with many degrees of freedom. Classification rules could be described by Gibbs measures defined on parameter sets and depending on the observed sample value. Comparing any posterior distribution with a Gibbs prior distribution make it possible to provide a way to build an estimator which can be proved to reach adaptively at the best possible asymptotic error rate (by temperature selection of a Gibbs posterior distribution built within a single parametric model). Estimators derived from Gibbs posteriors show excellent performance in diverse tasks, such as classification, regression and ranking. The usual recommendation is to sample from a Gibbs posterior using MCMC (Markov chain Monte Carlo). With covariant Souriau Gibbs density, it is possible to extend MCMC and Gibbs sampler approach for Lie Groups Machine Learning.

More recently, the use of perturbation techniques has been proposed as an alternative to MCMC techniques for sampling. These results have been extended in conditional random fields loss, proving that the maximum in expectation with low-rank perturbations, provides an upperbound on the log partition (what we call Massieu characteristic function). New lower bounds on the partition function and new unbiased sequential sampler for the Gibbs distribution based on low-rank perturbations have been introduced. All these methods are based on sampling from the Gibbs distribution, upper-bounding the log partition function. All these results are synthetized in [124], where they also propose a new general method, with connections to the recently-proposed Fenchel-Young losses [125], using doubly stochastic scheme for minimization of these losses, for unsupervised and supervised learning. This is a generalization to the Gibbs distribution. Methods for learning parameters of a Gibbs distribution on data (yi)i=1,,n are based on maximization of the likelihood:

l^n(θ)=1ni=1nlogpGibbs,θ(yi)=1ni=1nyi,θlogψ(θ) (190)

that is optimized by gradients methods using the empirical log-likelihood, given by:

θl^n(θ)=y^nEGibbs,θ[y] (191)

For this method of moment-matching, the expectation of the Gibbs distribution is a challenge in some cases. This approach has been replaced by computing pθ, with a method called “perturb-and-MAP” to learn the parameters in this model as a proxy for log-likelihood. This minimization is equivalent to maximizing previous equation by substituting the log-partition logψ(θ) with:

Fε(θ)=E[F(θ+εV)]=E[maxy,θ+εVyC] with a random noise vectorεV,ε>0 (192)

This approach could be linked with the use of Fenchel-Young losses [125]. In the perturbed model, the Fenchel-Young loss is given by:

Lε(θ;y)=Fε(θ)+εΩ(y)θ,y=DεΩ(y,y^ε*(θ)) (193)

with loss gradient Lε(θ;y)=Fε(θ)y=yε*(θ)y where yε*(θ)=Epθ(y)[y]=E[argmaxyCy,θ+εV] and DεΩ(y,y^ε*(θ)) Bregman divergence associated to εΩ. As Fε generalizes the log-sum-exp function on the simplex, its dual Ω is a generalization of the negative entropy (which is the Fenchel dual of log-sum-exp).These connections have been studied in [126].

To conclude, we have seen that Lie group tools based on Representation Theory and Orbits Methods could be used with Souriau-Fisher Metric on Coadjoint Orbits as an extension of Fisher Metric for Lie group through homogeneous Symplectic Manifolds on Lie group Co-Adjoint Orbits.

We can then beneficiate of different tools based on Souriau Lie groups Thermodynamics and Kirillov Representation Theory, as illustrated in Figure 6, for:

Figure 6.

Figure 6

Supervised/Non-Supervised Machine Learning based on Lie Groups Thermodynamics.

  • Supervised Machine Learning

  • Geodesic Natural Gradient on Lie Algebra: Extension of Neural Network Natural Gradient from Information Geometry on Lie Algebra for Lie Groups Machine Learning.

  • Souriau Maximum Entropy Density on Co-Adjoint Orbits: Covariant Maximum Entropy Probability Density for Lie groups defined with Souriau Moment Map, Co-Adjoint Orbits Method & Kirillov Representation Theory

  • Symplectic Integrator preserving Moment Map: Extension of Neural Network Natural Gradient to Geometric Integrators as Symplectic integrators that preserve moment map

  • Non-Supervised Machine Learning

  • Souriau Exponential Map on Lie Algebra: Exponential Map for Geodesic Natural Gradient on Lie Algebra based on Souriau Algorithm for Matrix Characteristic Polynomial

  • Fréchet Geodesic Barycenter by Hermann Karcher Flow: Extension of Mean/Median on Lie group by Fréchet Definition of Geodesic Barycenter on Souriau-Fisher Metric Space, solved by Karcher Flow.

  • Mean-Shift on Lie groups with Souriau-Fisher Distance: Extension of Mean-Shift for Homogeneous Symplectic Manifold and Souriau-Fisher Metric Space.

[There is nothing more in physical theories than symmetry groups except the mathematical construction which allows precisely to show that there is nothing more]    « Il n’y a rien de plus dans les théories physiques que les groupes de symétrie si ce n’est la construction mathématique qui permet précisément de montrer qu’il n’y a rien de plus ».

Jean-Marie Souriau (see Figure 7)

Figure 7.

Figure 7

Jean-Marie Souriau, student at ENS Paris 1942.

  La notion classique d’ensemble canonique de Gibbs est étendue au cas d’une variété symplectique sur laquelle un groupe de Lie possède une action symplectique (“groupe dynamique”). La définition rigoureuse donnée ici permet d’étendre un certain nombre de propriétés thermodynamiques classiques (la température est ici un élément de l’algèbre de Lie du groupe, la chaleur un élément de son dual), notamment des inégalités de convexité. Dans le cas de groupes non commutatifs, des propriétés particulières apparaissent: la symétrie est spontanément brisée, certaines relations de type cohomologique sont vérifiées dans l’algèbre de Lie du groupe [The classical notion of Gibbs’ canonical ensemble is extended to the case of a symplectic manifold on which a Lie group has a symplectic action (“dynamic group”). The rigorous definition given here makes it possible to extend a certain number of classical thermodynamic properties (the temperature here is an element of the Lie group algebra, heat an element of its dual), notably inequalities of convexity. In the case of non-commutative groups, particular properties appear: the symmetry is spontaneously broken, certain relations of cohomological type are verified in the Lie algebra of the group].

Jean-Marie Souriau, Mécanique Statistique, Groupes de Lie et Cosmologie, colloque CNRS n°237 – Géométrie Symplectique et physique mathématique

Appendix A. Coadjoint orbits and Moment Map for SU(1,1)

We give more details developpements to obtain SU(1,1)/K coadjoint orbit and moment map from [127]. SU(1,1) has been intensitively studied for Coherent states in Quantum Physics [128].

If we consider hyperbolic Group SL(2,R)

SL(2,R)={(mpqn)GL(2,R)/mnpq=1} (A1)

Elements of SL(2,R) could be written with Iwasawa decomposition:

g=kθ.at.nb with kθK,atA,nbN (A2)
K={kθ=(cosθsinθsinθcosθ)/0θ<2π} with eiθ=miqm2+q2A={at=(et00et)/tR} with e2t=m2+q2N={nb=(1b01)/bR} with b=mp+qnm2+q2 (A3)

K is a maximal compact sub-group of GSL.

Appendix A.1. Group of Unit Disk Automorphisms

Consider SU(1,1) sub-group of SL(2,R) given by

SU(1,1)={A=(abb*a*)/det(A)=|a|2|b|2=1, a,bR} (A4)

The following interior automorphism that transforms SL(2,R) to SU(1,1), inducing an isomorphism between them:

SL(2,R)SU(1,1)gCgC1 with C=12(1i1i) (A5)

SU(1,1) acts on the Poincaré unit disk D={zC/|z|<1}:

g1.z=az+bb*+a* with g1=(abb*a*)SU(1,1) and zD (A6)

Valentine Bargmann has parameterized SU(1,1):

{γ=baω=arg(α) mod 2π{|γ|<1a=eiω(1|γ|2)1/2b=eiωγ(1|γ|2)1/2 (A7)

Then SU(1,1)={(γ,ω)/|γ|<1,ω]π,+π]} with Group composition law given by:

(γ,ω).(γ,ω)=(γ,ω){γ=[γ+γe2iω][1+γγ*e2iω]1ω=ω+ω+12ilog([1+γ*γe2iω][1+γγ*e2iω]1) mod 2πg=(γ,ω)g1=(γe2iω,ω) (A8)

SU(1,1) is topological product of unit disk and circle.

Appendix A.2. Universal covering of SL(2,R)

If we consider G={(γ,ω)/|γ|<1,ωR}, the following mapping:

Θ:GSU(1,1)Θ(γ,ω)=(γ,ω mod 2π) with KerΘ={(0,2kπ)/kZ} (A9)

Topological product of unit Disk D dans C and real straight line R is the universal covering of SU(1,1).

A maximal compact subgroup of SU(1,1) is CKC1={(eiθ00eiθ)/θR} and the subgroup for G is Θ1(CKC1)={(0,θ)/θR}.

Pukanszky and Sally have defined irreductible unitary representation of SL˜(2,R), classified in principal serie, discrete serie and complemantary serie.

The Lie algebra g of G and SU(1,1) is given by:

l1=12(1001),l2=12(0110),l0=12(0110)  (A10)

with the commutation relation: [l0,l1]=l2,[l1,l2]=l0,[l2,l0]=l1.

The dual space of the Lie algebra g* of g to g thanks to Killing form:

B(i=1,2,0xili,i=1,2,0yili)=2(x1y1+x2y2x0y0) (A11)

Appendix A.3. Coadjointes Orbits of SU(1,1)

Considering adjoint representation Adg:gg, and coadjointe representation as transpose linear mapping of Adg1, written by Adg*=(Adg1)*, Adg*:g*g*.

Let f=(η,0,0) in base {l1,l2,l0} with η>0 coadjoint orbit of f is:

Oη={gfg1/g=(acbd)SL(2,R)}Oη={η(1+2bc),η(bdac),η(bd+ac)/a,b,c,dR with adbc=1} (A12)

Stabilizer of f is

G1(f)={gG/g.f=f}=A{A} with A={at=(et00et)/tR} (A13)

SL(2,R)=KNA with A=A{A} where N={nb=(1b01)/bR} and K={kθ=(cosθsinθsinθcosθ)/0θ<π} and as SL(2,R)/G1(f) is a bijection with Oη, then Oη is in bijection with KN, diffeomorph to K/{Id}×N. Then all element xOη can be written through this bijection x=kθnb,kθK,nbN. Oη is set of points l=(x1,x2,x0)g* such that: x12+x22x02=η2>0 a one sheet hyperboloid in g*.

We will study discrete sequence.

Appendix A.4. Quantization of Kähler Manifold

Let (M,ω) a Kalherian manifold of dimension 2n (complex manifold with complex structure J) with 2-form g(X,Y)=ω(X,JY), a Riemannian structure on M.

We have seen that SU(1,1) is conjugate of SL2(R) in SL(2,R)

(αββ*α*)SU(1,1),(abcd)SL2(R)(αββ*α*)=i2(1i1i)(abcd)(ii11) (A14)
{α=12[(a+d)+i(bc)]β=12[(ad)i(b+c)]{α+α*=a+dαα*=i(bc)β+β*=adββ*=i(b+c){a=12[(α+α*)+(β+β*)]b=12i[(αα*)(ββ*)]c=12i[(αα*)+(ββ*)]d=12[(α+α*)(β+β*)] (A15)

Let f=(0,0,h12) in base {l1,l2,l0} where l1=12(1001),l2=12(0110),l0=12(0110)  with h>12 (respectively f=(0,0,h+12) with h<12), the coadjoint orbit of f is:

Oh={gfg1/g=(acbd)SL(2,R)}Oh={(|h|12)signh(ac+bd);(|h|12)signh2(d2+c2(a2+b2));(|h|12)signh2(d2+c2+a2+b2)} (A16)

Oh appears as points l=(x1,x2,x0)g* such that:

x12+x22x02=(|h|212)2<0 with {x0>0 if h>12x0<0 if h<12 (A17)

Oh is one of hyperboloid sheets of g*, associated to representation πh of discrete serie of G.

{x1=h(ac+bd)x2=h2(d2a2+c2b2)x0=h2(d2+a2+c2+b2) with h=(|h|12)signh (A18)

Then, we obtain:

{x1=ih(αβα*β*)x2=h(αβ+α*β*)x0=h(αα*+ββ*) (A19)

If we set:

{α=eiθ1|z|2β=zeiθ1|z|2{x1=ih(zz*)1|z|2x1=h(z+z*)1|z|2x0=h(1+|z|2)1|z|2 (A20)

We use the parametrization of Oh by unit disk D:

D={zC/|z|<1}Ohz(x1,x2,x0) (A21)
{x1=2(|h|12)signhIm(z)1|z|2x1=2(|h|12)signhRe(z)1|z|2x0=(|h|12)signh1+|z|21|z|2 (A22)

This parametrisation provides a kahlerian structure for Oh, inherit from D.

We have Oh=G/K˜=SL(2,R)/K, and as SL(2,R) is isomorph to SU(1,1), Oh is identified with SU(1,1)/K0=D. Stabilizer of the origin is K0={(α00α*)/|α|=1}. Then Oh is globally diffeomorph to D for all h.

We can compute Liouville measure on Oh. This measure is ωh=ωh2π where ωh is the canonical 2-form on orbit. Parameterization on Oh gives:

ωh=ωh2π=i(2|h|1)2π(1|z|2)signh|dz|2 (A23)

We can observe that:

{dx1=i(1|z|2)[(zz*)(1zz*)dh+h(1z*2)dzh(1z2)dz*]dx2=1(1|z|2)[(z+z*)(1zz*)dh+h(1+z*2)dzh(1+z2)dz*]dx0=1(1|z|2)[(1+zz*)(1zz*)dh+2z*hdz2zhdz*] (A24)
dx1dx2dx0=ih2(1|z|2)(1|z|2)6det(zz*1|z|2(1z2)z+z*1+z*21+z21+|z|22z*2z) (A25)
dx1dx2dx0=2ih2(1|z|2)dh|dz|2=2πhdhωh (A26)

This the measure defined on open set of g* given by l12+l22l02<0.

Appendix B. Bargman Parameterization of SU(1,1)

SU(1,1) is isomorphic to SL(2,R)=Sp(2,R) through the complex unitary matrix W:

SL(2,R)={g=(abcd)/detg=adbc=1} (A27)
Sp(2,R)={g=(abcd)/gJgT=J,J=(0+110)} (A28)
W=12(ω1ω1ωω)=(W+)1 with ω=eiπ/4=12(1+i) (A29)

If we observe that W1JW=iM, the isomorphism is given explicitely by:

(abcd)=g(u)=WuW1=(Re(α+β)Im(αβ)Im(α+β)Re(αβ)) (A30)
(αββ*α*)=u(g)=W1gW=12((a+d)i(bc)(ad)+i(b+c)(ad)i(b+c)(a+d)+i(bc)) (A31)

We can also make also a link with SO(2,1) of “1 + 2” pseudo-orthogonal matrices:

SO(2,1)={ΓGL(3,3)/det(Γ)=1,ΓKΓT=Γ,K=(+100010001)} (A32)
Γ(g)=(12(a2+b2+c2+d2)12(a2b2+c2d2)cdab12(a2+b2c2d2)12(a2b2c2+d2)cdabbdacbdacad+bc) (A33)

with Γ(g1)Γ(g2)=Γ(g1g2),Γ(I)=I,Γ(g1)=Γ(g)1

The SO(2,1) matrix corresponds to any SU(1,1):

Γ(u)=(|α|2+|β|22Reαβ*2Imαβ*2ReαβRe(α2+β2)Im(α2β2)2ImαβIm(α2+β2)Re(α2β2)) (A34)

and α=±12(Γ11+Γ12)+i(Γ12Γ21),β=12α(Γ10iΓ20)

The properties of connectivity of Sp(2,R) is described by its isomorphy with SU(1,1). Using unimodular condition:

|α|2|β|2=1αR2+αI2βR2=1+βI21 with α=αR+iαI and β=βR+iβI

If βI is fixed, (αR,αI,βR) are constrained to define a one-sheeted revolution hyperboloid, with its circular waist in the α plane.

To SU(1,1), we can associate the simply-connected universal covering group, using the maximal compact subgroup U(1) and corresponding to the Iwasawa decomposition (factorization of a noncompact semisimple group into its maximal compact subgroup times a solvable subgroup).

(αββ*α*)=(eiω00eiω)(λμμ*λ) with {ω=argα=12iln(α*α1)λ=|α|>0μ=eiωβ=α*αβ (A35)
β=eiωμ,|α|2|β|2=λ2|μ|2=1 so |μ|<λ (A36)

Bargmann has generalized this parameterization for Sp(2N,R), more convenient but difficult to generalize to N dimensions. For SU(1,1):, Bargmann has used (ω,γ):

γ=μλ=βα (|γ|<1) , λ=11|γ|2,μ=γ1|γ|2 (A37)

For SL(2,R)=Sp(2,R), the Bargman, parameterization is given by this decomposition of a non-singular matrix into the product of an orthogonal and a positive definite symmetric matrix:

(abcd)=(cosωsinωsinωcosω)(λ+ReμImμImμλReμ) (A38)

Conversely: ω=arg[(a+d)i(bc)],μ=eiω[(ad)+i(b+c)]

ω is counted modulo 2π, ωω(mod2π).

SU(1,1) and SL(2,R)=Sp(2,R) are described when ω is counted modulo 2π, ωω(mod2π).

Valentine Bargmann has proposed the covering of the general symplectic group Sp(2N,R):

Sp(2N,R)={g=(ABCD)/gJ2NgT=J2N,J2NT=J2N,J2N=(0ININ0)} (A39)

with relations:

ABT=BAT,ACT=CAT,BDT=DBT,CDT=DCT,ADTBCT=IN (A40)
gSp(2N,R)g1=M2NgTM2N=(DTBTCTAT) (A41)

Bargmann has observed that although Sp(2N,R) is not isomorphic to any pseudo-unitary group, its inclusion in U(N,N) will display the connectivity properties through its unitary U(N) maximal compact subgroup, generalizing the role of U(1)=SO(2) in Sp(2,R).

WN=WIN a 2N×2N matrix where W=W1=12(ωπ/41ωπ/41ωπ/4ωπ/4) with ω=eiπ/4=12(1+i), which gives the N×N block coefficients.

u(g)=WN1gWN=12([A+D]i[BC][AD]+i[B+C][AD]i[B+C][A+D]+i[BC])=(αββ*α*) (A42)

with

{αα+ββ+=IN,α+αβTβ*=INαβTβαT=0,αTβ*β+α=0 (A43)

and

u1=M2Nu+M2N1=(α+βTβ+αT) (A44)

The symplecticity property of g becomes:

uM2Nu+=M2N,M2N=iWN1J2NWN=(IN00IN) (A45)
(ABCD)=g(u)=WNuWN1=(Re(α+β)Im(αβ)Im(α+β)Re(αβ)) (A46)

Valentine Bargmann has extended the well-know theorem that any real matrix R may be decomposed into the product of an orthogonal Q and a symmetric positive definite matrix S, uniquely as R=QS. Bargmann has shown that if RSp(2N,R), then R=QS with Q,SSp(2N,R) where Q maps onto unitary matrix and S maps onto Hermitian positive definite matrix:

u(Q)=(α00α*),αα+=IN,αU(N) and u(S)=exp(0ξξ*0),ξ=ξT (A47)

We can generalize Bargmann parameterization of SU(1,1) to Sp(2N,R):

u{ω,λ,μ}=(eiωIN00eiωIN)(λμμ*λ*),detλ>0 (A48)

Then the Bargmann parameters are:

ω=1Nargdetα,λ=eiωα,μ=eiωβ,eiNω=detα|detα|,detλ=|detα|>0 (A49)

The Sp(2N,R) matrices in terms of the Bargmann parameters are:

g{ω,λ,μ}=(cosωINsinωINsinωINcosωIN)(Re(λ+μ)Im(λμ)Im(λ+μ)Re(λμ)) (A50)

V. Bargmann has proposed the covering of the general symplectic group Sp(2N,R):

Sp(2N,R)={g=(ABCD)/gJ2NgT=J2N,J2NT=J2N,J2N=(0ININ0)} (A51)
ABT=BAT,ACT=CAT,BDT=DBT,CDT=DCT,ADTBCT=IN (A52)

Bargmann has observed that although Sp(2N,R) is not isomorphic to any pseudo-unitary group, its inclusion in U(N,N) will display the connectivity properties through its unitary U(N) maximal compact subgroup, generalizing the role of U(1)=SO(2) in Sp(2,R):

WN=WIN,2N×2N matrix where W=W1=12(ωπ/41ωπ/41ωπ/4ωπ/4) with ω=eiπ/4=12(1+i).

u(g)=WN1gWN=12([A+D]i[BC][AD]+i[B+C][AD]i[B+C][A+D]+i[BC])=(αββ*α*) (A53)

with

αα+ββ+=IN,α+αβTβ*=IN and αβTβαT=0,αTβ*β+α=0 (A54)

The symplecticity property of g becomes:

uM2Nu+=M2N,M2N=iWN1J2NWN=(IN00IN) (A55)
(ABCD)=g(u)=WNuWN1=(Re(α+β)Im(αβ)Im(α+β)Re(αβ)) (A56)

Appendix C. Shirokov Method to build Casimir function

I. V. Shirokov [71,72,73,74,75] has proposed a method for constructing invariants of the coadjoint representation of Lie groups with an arbitrary dimension and structure based on local symplectic coordinates on the coadjoint orbits. They also have made link with construction of the Lie algebra polarization

Bλ(X,Y) be the skewsymmetric form on g defined by:

Bλ(X,Y)=λ,[X,Y] , X,Yg , λg*with Bλ(X,Y)=BijXiYj , Bij=CijkλkCij(λ) , i,j,k=dimg (A57)
KerBλ={Xg/Bλ(X,g)=0}gλ annihilator of λg* (A58)

The polarization 𝖍𝕻(g) of the the covector λ is given by:

Bλ(𝖍,𝖍)=0 , dim𝖍=12(dimg+dimgλ) (A59)

The polarization of a semisimple Lie algebra is the construction of polarization of the reductive Lie algebra g, where g=CB, where C is the center in g, and B is a semisimple subalgebra, given by:

𝖍𝕻(C)𝕻(B)=C𝕻(B) (A60)

Let g be an arbitrary Lie algebra, R its radical and R={Xg/Bλ(X,R)=0} the orthogonal complement of R in g with respect to the form Bλ(X,Y), then the polarization is given by:

𝕻(g)𝕻(R)𝕻(R)=(R)𝕻(R) (A61)

Oleg L. Kurnyavko and Igor V. Shirokov idea of the method of constructing coadjoint invariants is to construct the canonical transition to the Darboux coordinates on the orbits of the dual Lie algebra g* of maximal dimension dual to the Lie algebra g of the Lie group G.

If we consider a Lie group G, its Lie algebra g and its dua Lie algebra g*, and let each element fg* corresponds to the coordinates (f1,f2,,fn) where n=dimg*. The action of the group G on g* splits the dual Lie algebra into orbits Oλ where λg* is some element of the given orbit. As with Souriau Theorem, these orbits is a symplectic manifold, with Darboux theorem, there is a special coordinates (p,q) on them such that the corresponding symplectic form can be written:

ωλ=dpidqi , i=1,,12dimOλ (A62)

The transition to canonical coordinates is determined by the functions:

fj=fj(q,p,λ) , j=1,2,,dimg* , with fj(0,0,λ)=λj (A63)

Eliminating the variables q and p in these transition functions, we deduce the equation of the orbit:

ψk(f)=ωk , k=1,,ndimOλ  (A64)

We obtain ndimOλ functions such that:

ψk(Adg*f)=ψk(f) (A65)

Each nondegenerate orbit is determined by the set of numbers ω={ω1,ω2,,ωindg}, that is fixed by the choice of the covector λ such that rank Cij(λ) is maximal, and then:

ωk=ψk(λ1)=ψk(λ2) , if λ1,λ2Oλ (A66)

We can then consider λ(j) orbits Oλ with j=(j1,j2,,jindg), with the constraint:

ψk(λ(j))=ωk(j) , detωkj0  (A67)

If we set λi(j)=ailjl and Xki the components of the vectors that form the basis of the annihilator of the covector λ(j) such that:

detωk(j)jk=detψkfifi(j)jk|f=λ(j)=detXkifi(j)jk=detXki(λ(j))aik0 (A68)

Finally, we obtain the parametrization, considering with Darboux coordiantes:

fi=fi(q,p,j) , i=1,,indg* (A69)

By eliminating the Darboux coordinate (q,p) in previous equation, we have new relations:

φk(f,j)=0 and jk=jk(f) , k=1,,indg (A70)

These relations provide invariants of the coadjoint representation of the Lie group G.

Igor V. Shirokov has proved that the desired linear transition to the Darboux coordinates on a nondegenerate orbit Oλ is defined by

fi(p,q)=ξia(q)pa+ξiA(0,q)λA where g={lA,la} with lA polarization basis of g (A71)

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

References

  • 1.Duhem P. L’intégrale des forces vives en thermodynamique. JMPA. 1898;4:5–19. [Google Scholar]
  • 2.Duhem P. Procès-verbaux des Séances de la Société des Sciences Physiques et Naturelles de Bordeaux. PVSScPhNB (1897-98); Bordeaux, France: 1898. Sur l’équation des forces vives en thermodynamique et les relations de la thermodynamique avec la mécanique classique 23 December 1897; pp. 23–27. [Google Scholar]
  • 3.Duhem P. Sur deux Inégalites Fondamentales de la Thermodynamique, CR 156. Gauthier-Villars; Paris, France: 1913. pp. 421–425. [Google Scholar]
  • 4.Duhem P. Traité d’Énergetique ou Thermodynamique Générale. Tome Conservation de l’Énergie. Mécanique Rationelle. Statique Générale. Déplacement de l’Équilibre—Tome II. Dynamique Générale. Conductibilité de la Chaleur. Stabilité de l’Équilibre. GauthierVillars; Paris, France: 1911. pp. 504–528. [Google Scholar]
  • 5.Ollivier Y., Maceau-Caron G. Practical Riemannian Neural Networks. arxiv. 2016abs/1602.08007 [Google Scholar]
  • 6.Ollivier Y. Riemannian metrics for neural networks I: Feedforward networks. Inf. Inference J. IMA. 2015;4:108–153. doi: 10.1093/imaiai/iav006. [DOI] [Google Scholar]
  • 7.Marceau-Caron G., Ollivier Y., Nielsen F., Barbaresco F. Computer Vision. Volume 10589. Springer Science and Business Media LLC; Berlin/Heidelberg, Germany: 2017. Natural Langevin Dynamics for Neural Networks; pp. 451–459. [Google Scholar]
  • 8.Ollivier Y. True Asymptotic Natural Gradient Optimization. arXiv. 20171712.08449 [Google Scholar]
  • 9.Amari S., Karakida R., Oizumi M. Fisher information and natural gradient learning in random deep networks; Proceedings of the AISTATS 2019; Okinawa, Japan. 16–18 April 2019. [Google Scholar]
  • 10.Zhang G., Martens J., Grosse R. Fast convergence of natural gradient descent for over parametrized neural networks. arXiv. 20191905.10961v1 [Google Scholar]
  • 11.Amari S.-I. Any Target Function Exists in a Neighborhood of Any Sufficiently Wide Random Network: A Geometrical Perspective. arXiv preprint. 2020 doi: 10.1162/neco_a_01295.2001.06931 [DOI] [PubMed] [Google Scholar]
  • 12.Kirillov A.A. Elements of the Theory of Representations. Volume 220 Springer Science and Business Media LLC; Berlin/Heidelberg, Germany: 1976. [Google Scholar]
  • 13.Kosmann-Schwarzbach Y. Groupes et Symétries Groupes Finis, Groupes et Algèbres de Lie, Representations. Ecole Polytechnique; Palaiseau, France: 2006. [Google Scholar]
  • 14.Kosmann-Schwarzbach Y. Siméon-Denis Poisson, Les Mathématiques au Service de la Science. Ecole Polytechnique; Palaiseau, France: 2013. [Google Scholar]
  • 15.Laurent-Gengoux C., Pichereau A., Vanhaecke P. Poisson Structures, Grundlehren der Mathematischen Wissenschaften. Springer; Berlin/Heidelberg, Germany: 2013. [Google Scholar]
  • 16.Cartier P. Groupoïdes de Lie et leurs algébroïdes. Séminaire BOURBAKI. ASTÉRISQUE. [(accessed on 6 June 2020)];2009 326 Available online: http://preprints.ihes.fr/2008/M/M-08-20.pdf. [Google Scholar]
  • 17.Barbaresco F., Gay-Balmaz F. Lie Group Cohomology and (Multi)Symplectic Integrators: New Geometric Tools for Lie Group Machine Learning Based on Souriau Geometric Statistical Mechanics. Entropy. 2020;22:498. doi: 10.3390/e22050498. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.HYPERBOLIC DEEP LEARNING: A Nascent and Promising Field. [(accessed on 2 June 2020)]; Available online: http://hyperbolicdeeplearning.com/
  • 19.Fakhri H. Su(1, 1)-Barut–Girardello coherent states for Landau levels. J. Phys. A Math. Gen. 2004;37:5203–5210. doi: 10.1088/0305-4470/37/19/007. [DOI] [Google Scholar]
  • 20.Gazeau J.-P. Coherent States in Quantum Optics: An Oriented Overview. arXiv preprint. 20181810.06473 [Google Scholar]
  • 21.Citti G., Sarti A. A Cortical Based Model of Perceptual Completion in the Roto-Translation Space. J. Math. Imaging Vis. 2006;24:307–326. doi: 10.1007/s10851-005-3630-2. [DOI] [Google Scholar]
  • 22.Miolane N. Ph.D. Thesis. INRIA Sophia Antipolis; Valbonne, France: Stanford University; Stanford, CA, USA: 2016. Geometric Statistics for Computational Anatomy. [Google Scholar]
  • 23.Verblunsky S. On Positive Harmonic Functions: A Contribution to the Algebra of Fourier Series. Proc. Lond. Math. Soc. 1935;38:125–157. doi: 10.1112/plms/s2-38.1.125. [DOI] [Google Scholar]
  • 24.Verblunsky S. On positive harmonic functions (second paper) Proc. Lond. Math. Soc. 1936;40:290–320. doi: 10.1112/plms/s2-40.1.290. [DOI] [Google Scholar]
  • 25.Trench W.F. An Algorithm for the Inversion of Finite Toeplitz Matrices. J. Soc. Ind. Appl. Math. 1964;12:515–522. doi: 10.1137/0112045. [DOI] [Google Scholar]
  • 26.Barfoot T.D., Furgale P. Associating Uncertainty with Three-Dimensional Poses for Use in Estimation Problems. IEEE Trans. Robot. 2014;30:679–693. doi: 10.1109/TRO.2014.2298059. [DOI] [Google Scholar]
  • 27.Barrau A., Bonnabel S. Intrinsic Filtering on Lie Groups With Applications to Attitude Estimation. IEEE Trans. Autom. Control. 2014;60:436–449. doi: 10.1109/TAC.2014.2342911. [DOI] [Google Scholar]
  • 28.Chirikjian G.S. Stochastic Models, Information Theory, and Lie Groups, Volume 1: Classical Results and Geometric Methods. Applied and Numerical Harmonic Analysis. Birkhäuser; Basel, Switzerland: 2009. [DOI] [Google Scholar]
  • 29.Gregory S.C. Stochastic Models, Information Theory, and Lie Groups, Volume 2: Analytic Methods and Modern Applications. Springer Science & Business Media; Berlin/Heidelberg, Germany: 2011. [Google Scholar]
  • 30.Gromov M. In a Search for a Structure, Part 1: On Entropy; Proceedings of the European Congress of Mathematics; Kraków, Poland. 2–7 July 2012; pp. 51–78. [Google Scholar]
  • 31.Gromov M. Six Lectures on Probabiliy, Symmetry, Linearity. Jussieu. 2014:preprint. [Google Scholar]
  • 32.Arnaudon A., De Castro A.L., Holm D. Noise and Dissipation on Coadjoint Orbits. J. Nonlinear Sci. 2017;28:91–145. doi: 10.1007/s00332-017-9404-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Benenti S., Tulczyjew W.M. Cocycles of the coadjoint representation of a Lie group interpreted as differential forms. Mem. Accad. Sci. Torino. 1986;10:117–138. [Google Scholar]
  • 34.Benenti S., Tulczyjew W.M. A geometrical interpretation of the 1-cocycles of a Lie group. In: Prastaro A., editor. Geometrodynamics. World Scientific Publishing Co.; Singapore: 1985. pp. 3–24. [Google Scholar]
  • 35.Nencka H., Streater R.F. INFORMATION GEOMETRY FOR SOME LIE ALGEBRAS. Infin. Dimens. Anal. Quantum Probab. Relat. Top. 1999;2:441–460. doi: 10.1142/S0219025799000254. [DOI] [Google Scholar]
  • 36.Souriau J.-M. Géométrie symplectique et physique mathématique. Éditions du C.N.R.S; Aix-en-Provence, France: 1974. Mécanique statistique, groupes de Lie et cosmologie; pp. 59–113. [Google Scholar]
  • 37.Davis M.S. Homogeneous Symplectic Manifolds of the Galilei Group. Georgia Southern University; Statesboro, GA, USA: 2012. (Under the Direction of Francois Ziegler) [Google Scholar]
  • 38.Marle C.M. Géométrie Symplectique et Géométrie de Poisson. Calvage & Mounet; Paris, France: 2018. [Google Scholar]
  • 39.Vandebogert K. Notes on Symplectic Geometry. University of South Carolina; Columbia, SC, USA: Sep 3, 2017. [Google Scholar]
  • 40.Cartier P. Some Fundamental Techniques in the Theory of Integrable Systems, IHES/M/94/23, SW9421. [(accessed on 31 May 2020)];1994 Available online: https://cds.cern.ch/record/263222/files/P00023319.pdf.
  • 41.Bismut J.-M., Labourie F. Symplectic geometry and the Verlinde Formulas. Surv. Differ. Geom. 1999;5:97–311. doi: 10.4310/SDG.1999.v5.n1.a2. [DOI] [Google Scholar]
  • 42.Koszul J.-L., Zou Y.M. Introduction to Symplectic Geometry. Springer Science and Business Media LLC; Berlin/Heidelberg, Germany: 2019. [Google Scholar]
  • 43.Knapp A.W. Representation Theory of Semisimple Groups. Walter de Gruyter GmbH; Berlin, Germany: 1986. [Google Scholar]
  • 44.Frenkel I.B. Orbital theory for affine Lie algebras. Inven. Math. 1984;77:301–352. doi: 10.1007/BF01388449. [DOI] [Google Scholar]
  • 45.Libine M. Introduction to Representations of Real Semisimple Lie Groups. arXiv. 20141212.2578v2 [Google Scholar]
  • 46.Clerc J.-L., Ørsted B. The Maslov index revisited. Transform. Groups. 2001;6:303–320. doi: 10.1007/BF01237249. [DOI] [Google Scholar]
  • 47.Foth P., Lamb M. The Poisson geometry of SU(1,1) J. Math. Phys. 2010;51:092701. doi: 10.1063/1.3461878. [DOI] [Google Scholar]
  • 48.Perelomov A.M. Coherent states for arbitrary Lie group. Commun. Math. Phys. 1972;26:222–236. doi: 10.1007/BF01645091. [DOI] [Google Scholar]
  • 49.Hashimoto T., Ogura K., Okamoto K., Sawae R., Yasunaga Y. Kirillov-Konstant theory and Feynman path integrals on coadjoint orbits I. Hokkaido Math. J. 1991;20:353–405. doi: 10.14492/hokmj/1381413847. [DOI] [Google Scholar]
  • 50.Hashimoto T., Ogura K., Okamoto K., Sawae R. Kirillov Konstant theory and Feynman path integrals on coadjoint orbits of SU(2) and SU(1, 1) Int. J. Mod Phys. 1992;A7(Suppl. 1A):377–390. doi: 10.1142/S0217751X92003859. [DOI] [Google Scholar]
  • 51.Hashimoto T., Ogura K., Okamoto K., Sawae R. Borel-Weil theory and Feynman path integrals on flag manifolds. Hiroshima Math. J. 1993;23:231–247. doi: 10.32917/hmj/1206128252. [DOI] [Google Scholar]
  • 52.Hashimoto T. KirillovKonstant theory and Feynman path integrals on coadjoint orbits of a certain real semisimple Lie group. Hiroshima Math. J. 1993;23:607–627. doi: 10.32917/hmj/1206392785. [DOI] [Google Scholar]
  • 53.Cishahayo C., De Bièvre S. On the contraction of the discrete series of SU(1,1) Ann. Inst. Fourier. 1993;43:551–567. doi: 10.5802/aif.1346. [DOI] [Google Scholar]
  • 54.Cahen B. Contraction de SU(1,1) vers le Groupe de Heisenberg. Travaux Mathématiques. 2004;XV:19–43. [Google Scholar]
  • 55.Cahen M., Gutt S., Rawnsley J. Quantization on Kähler manifolds I, Geometric interpretation of Berezin quantization. J. Geom. Phys. 1990;7:45–62. [Google Scholar]
  • 56.Marle C.-M. Projection Stéréographique et Moments, Hal-02157930, Version 1. [(accessed on 31 May 2020)];2019 Jun; Available online: https://hal.archives-ouvertes.fr/hal-02157930/
  • 57.Guichardet A. Leçons de Mathématiques d’Aujourd’hui. Volume 4. Cassini; Paris, France: 2010. La methode des orbites: Historiques, principes, résultats; pp. 33–59. [Google Scholar]
  • 58.Vergne M. Emmy Noether in Bryn Mawr. Springer; Berlin/Heidelberg, Germany: 1983. Representations of Lie Groups and the Orbit Method; pp. 59–101. [DOI] [Google Scholar]
  • 59.Duflo M., Heckman G., Vergne M. Projection d’orbites, formule de Kirillov et formule de Blattner. Mémoires Société Mathématique Fr. 1984;1:65–128. doi: 10.24033/msmf.300. [DOI] [Google Scholar]
  • 60.Pukanszky L. The Plancherel formula for the universal covering group of SL(R, 2) Math. Ann. 1964;156:96–143. doi: 10.1007/BF01359927. [DOI] [Google Scholar]
  • 61.Pukanszky L. Leçons sur les Représentations des Groupes, Monographies de la Société Mathématique de France. Dunod; Paris, France: 1967. [Google Scholar]
  • 62.Bernat P. Représentations des Groupes de Lie, Monographie de la Société Mathématique de France. Dunod; Paris, France: 1972. [Google Scholar]
  • 63.Dixmier J. Enveloping Algebras. American Mathematical Society; Providence, RI, USA: 1996. [Google Scholar]
  • 64.Duflo M. Construction De Representations Unitaires D’un Groupe De Lie. In: Talamanca A.F., editor. Harmonic Analysis and Group Representation. Volume 82 Springer; Berlin/Heidelberg, Germany: 2010. C.I.M.E. Summer Schools 1980. [Google Scholar]
  • 65.Guichardet A. Théorie de Mackey et méthode des orbites selon M. Duflo. Expo. Math. 1985;3:303–346. [Google Scholar]
  • 66.Mnemné R., Testard F. Groupes de Lie Classiques. Hermann; Paris, France: 1985. [Google Scholar]
  • 67.Rais M. Orbites Coadjointes et Représentations des Groupes, Cours. C.I.M.P.A.; Blagnac/Toulouse, France: 1980. [Google Scholar]
  • 68.Rais M. La représentation coadjointe du groupe affine. Ann. Inst. Fourier. 1978;28:207–237. doi: 10.5802/aif.686. [DOI] [Google Scholar]
  • 69.De Micheli E. On the Connection between Spherical Laplace Transform and Non-Euclidean Fourier Analysis. Mathematics. 2020;8:287. doi: 10.3390/math8020287. [DOI] [Google Scholar]
  • 70.Marsden J.E., Misiolek G., Ortega J.-P., Perlmutter M., Ratiu T.S. Hamiltonian Reduction by Stages. Springer Science and Business Media LLC; Berlin/Heidelberg, Germany: 2007. [Google Scholar]
  • 71.Kurnyavko O.L.I., Shirokov V. Algebraic method for construction of infinitesimal invariants of Lie groups representations. arXiv. 20171710.07977 [Google Scholar]
  • 72.Kurnyavko O.L., Shirokov I.V. Construction of invariants of the coadjoint representation of Lie groups using linear algebra methods. Theor. Math. Phys. 2016;188:965–979. doi: 10.1134/S0040577916070011. [DOI] [Google Scholar]
  • 73.Shirokov I.V. Differential invariants of the transformation group of a homogeneous space. Sib. Math. J. 2007;48:1127–1140. doi: 10.1007/s11202-007-0114-1. [DOI] [Google Scholar]
  • 74.Goncharovskii M.M., Shirokov I.V. Differential invariants and operators of invariant differentiation of the projectable action of Lie groups. Theor. Math. Phys. 2015;183:619–636. doi: 10.1007/s11232-015-0285-z. [DOI] [Google Scholar]
  • 75.Shirokov I.V. Darboux coordinates onK-orbits and the spectra of Casimir operators on lie groups. Theor. Math. Phys. 2000;123:754–767. doi: 10.1007/BF02551030. [DOI] [Google Scholar]
  • 76.Holm D. Variational principles for stochastic fluid dynamics. Proc. R. Soc. A Math. Phys. Eng. Sci. 2015;471:20140963. doi: 10.1098/rspa.2014.0963. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Gay-Balmaz F., Holm D. Selective decay by Casimir dissipation in inviscid fluids. Nonlinearity. 2013;26:495–524. doi: 10.1088/0951-7715/26/2/495. [DOI] [Google Scholar]
  • 78.Gay-Balmaz F., Holm D. A geometric theory of selective decay with applications in MHD. Nonlinearity. 2014;27:1747–1777. doi: 10.1088/0951-7715/27/8/1747. [DOI] [Google Scholar]
  • 79.Casimir H.G.B. Uber die konstruktion einer zu den irreduziblen darstellungen halbeinfacher kontinuierlicher gruppen gehörigen differentialgleichung. Proc. R. Soc. Amst. 1931;34:844–846. [Google Scholar]
  • 80.Racah G. Sulla caratterizzazione delle rappresentazioni irriducibili dei gruppi semisemplici di Lie. Atti Accad. Naz. Lincei. Rend. Cl. Sci. Fis. Mat. Nat. 1950;8:108–112. [Google Scholar]
  • 81.Oller P.J. Oller Moving frames and differential invariants in centro-affine geometry. Lobachevskii J. Math. 2010;31:77–89. doi: 10.1134/S1995080210020010. [DOI] [Google Scholar]
  • 82.Berezin D.V. Invariants of the co-adjoint representation for Lie algebras of a special form. Russ. Math. Surv. 1996;51:137–139. doi: 10.1070/RM1996v051n01ABEH002751. [DOI] [Google Scholar]
  • 83.Abellanas L. A general setting for Casimir invariants. J. Math. Phys. 1975;16:1580. doi: 10.1063/1.522727. [DOI] [Google Scholar]
  • 84.Beltrametti E., Blasi A. On the number of Casimir operators associated with any lie group. Phys. Lett. 1966;20:62–64. doi: 10.1016/0031-9163(66)91048-1. [DOI] [Google Scholar]
  • 85.Pecina-Cruz J.N. An algorithm to calculate the invariants of any Lie algebra. J. Math. Phys. 1994;35:3146–3162. doi: 10.1063/1.530458. [DOI] [Google Scholar]
  • 86.Dixmier J. Algèbres enveloppantes (Cahiers Scientifiques. No. 37) Gauthier-Villars; Paris, France: 1974. [Google Scholar]
  • 87.Mikheyev V.V., Shirokov I.V. Application of coadjoint orbits in the thermodynamics of non-compact manifolds. Electron. J. Theor. Phys. 2005;2:1–10. [Google Scholar]
  • 88.Mikheyev V., Nielsen F., Barbaresco F. Applications of Evolutionary Computation. Volume 10589. Springer Science and Business Media LLC; Berlin/Heidelberg, Germany: 2017. Method of Orbits of Co-Associated Representation in Thermodynamics of the Lie Non-compact Groups; pp. 425–431. [Google Scholar]
  • 89.Fomenko A.T., Trofimov V.V. Integrable Systems on Lie Algebras and Symmetric Spaces. Gordon and Breach Science Publishers; New York, NY, USA: 1988. [Google Scholar]
  • 90.Trofimov V.V. Introduction to Geometry of Manifolds with Symmetry. Springer Science and Business Media LLC; Berlin/Heidelberg, Germany: 1994. [Google Scholar]
  • 91.Machon T. The Godbillon–Vey invariant as a restricted Casimir of three-dimensional ideal fluids. J. Phys. A Math. Theor. 2020;53:235701. doi: 10.1088/1751-8121/ab8b35. [DOI] [Google Scholar]
  • 92.Casimir H.B.G. On Onsager’s Principle of Microscopic Reversibility. Rev. Mod. Phys. 1945;17:343. doi: 10.1103/RevModPhys.17.343. [DOI] [Google Scholar]
  • 93.Thiffeault J.-L., Morrison P. Classification and Casimir invariants of Lie–Poisson brackets. Phys. D Nonlinear Phenom. 2000;136:205–244. doi: 10.1016/S0167-2789(99)00155-4. [DOI] [Google Scholar]
  • 94.Arnaudon A., De Castro A.L., Holm D.D. Noise and dissipation in rigid body motion. arXiv. 2016 Jun 20;1606.06308v1 [Google Scholar]
  • 95.Bargmann V. Irreducible Unitary Representations of the Lorentz Group. Ann. Math. 1947;48:568. doi: 10.2307/1969129. [DOI] [Google Scholar]
  • 96.Souriau J.-M. Mécanique Statistique, Groupes de Lie et Cosmologie. [(accessed on 20 April 2020)];2020 Available online: https://www.academia.edu/42630654/Statistical_Mechanics_Lie_Group_and_Cosmology_1_st_part_Symplectic_Model_of_Statistical_Mechanics.
  • 97.Souriau J.-M. Structure des Systèmes Dynamiques. Dunod; Paris, France: 1969. [Google Scholar]
  • 98.Souriau J.M. Mécanique Classique et Géométrie Symplectique, Rapport CNRS CPT-84/PE.1695. Université de Provence et Centre de Physique Théorique CNRS; Marseille, France: 1984. [Google Scholar]
  • 99.Souriau J.M. Equations Canoniques et Géométrie Symplectique. Pub. Sci. Univ. Alger. Sér. A. 1954;1:239–265. [Google Scholar]
  • 100.Souriau J.M. Géométrie de l’Espace des Phases, Calcul des Variations et Mécanique Quantique, Tirage Ronéotypé. Faculté des Sciences; Marseille, France: 1965. [Google Scholar]
  • 101.Souriau J.-M. Realisations d’algèbres de Lie au moyen de variables dynamiques. Il Nuovo Cim. A. 1967;49:197–198. doi: 10.1007/BF02739084. [DOI] [Google Scholar]
  • 102.Marle C.-M. From Tools in Symplectic and Poisson Geometry to J.-M. Souriau’s Theories of Statistical Mechanics and Thermodynamics. Entropy. 2016;18:370. doi: 10.3390/e18100370. [DOI] [Google Scholar]
  • 103.Barbaresco F. Higher Order Geometric Theory of Information and Heat Based on Poly-Symplectic Geometry of Souriau Lie Groups Thermodynamics and Their Contextures: The Bedrock for Lie Group Machine Learning. Entropy. 2018;20:840. doi: 10.3390/e20110840. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104.Barbaresco F. Souriau Exponential Map Algorithm for Machine Learning on Matrix Lie Groups. Springer Science and Business Media LLC; Berlin/Heidelberg, Germany: 2019. pp. 85–95. [Google Scholar]
  • 105.Barbaresco F. Geometric Theory of Heat from Souriau Lie Groups Thermodynamics and Koszul Hessian Geometry: Applications in Information Geometry for Exponential Families. Entropy. 2016;18:386. doi: 10.3390/e18110386. [DOI] [Google Scholar]
  • 106.Barbaresco F. Applications of Evolutionary Computation. Volume 11712. Springer Science and Business Media LLC; Berlin/Heidelberg, Germany: 2019. Lie Group Machine Learning and Gibbs Density on Poincaré Unit Disk from Souriau Lie Groups Thermodynamics and SU(1,1) Coadjoint Orbits; pp. 157–170. [Google Scholar]
  • 107.Barbaresco F. Application exponentielle de matrice par l’extension de l’algorithme de Jean-Marie Souriau, utilisable pour le tir géodésique et l’apprentissage machine pour les groupes de Lie; Proceedings of the Colloque GRETSI 2019; Lille, France. 26–29 August 2019. [Google Scholar]
  • 108.Ishi H., Kolodziejek B. Characterization of the Riesz Exponential Familly on Homogeneous Cones. arXiv. 20181605.03896 [Google Scholar]
  • 109.Tojo K., Yoshino T. On a Method to Construct Exponential Families by Representation Theory. arXiv. 20181811.01394 [Google Scholar]
  • 110.Tojo K., Yoshino T. Applications of Evolutionary Computation. Springer Science and Business Media LLC; Berlin/Heidelberg, Germany: 2019. On a Method to Construct Exponential Families by Representation Theory; pp. 147–156. [Google Scholar]
  • 111.Tojo K., Yoshino T. Harmonic exponential families on homogeneous spaces. 2020. preprint.
  • 112.Arnold V. Sur la géométrie différentielle des groupes de Lie de dimension infinie et ses applications à l’hydrodynamique des fluides parfaits. Ann. Inst. Fourier. 1966;16:319–361. doi: 10.5802/aif.233. [DOI] [Google Scholar]
  • 113.Arnold V.I., Givental A.B. Symplectic Geometry. In: Arnol’d V.I., Novikov S.P., editors. Dynamical Systems IV: Symplectic Geometry and Its Applications. Volume 4. Springer; Berlin, Gemany: 1990. pp. 1–136. Encyclopaedia of Mathematical Sciences. [Google Scholar]
  • 114.Balian R., Alhassid Y., Reinhardt H. Dissipation in many-body systems: A geometric approach based on information theory. Phys. Rep. 1986;131:1–146. doi: 10.1016/0370-1573(86)90005-0. [DOI] [Google Scholar]
  • 115.Balian R., Balazs N. Equiprobability, inference, and entropy in quantum theory. Ann. Phys. 1987;179:97–144. doi: 10.1016/S0003-4916(87)80006-4. [DOI] [Google Scholar]
  • 116.Balian R. On the principles of quantum mechanics and the reduction of the wave packet. Am. J. Phys. 1989;57:1019. doi: 10.1119/1.15813. [DOI] [Google Scholar]
  • 117.Balian R. From Microphysics to Macrophysics. Volume 1–2 Springer Science and Business Media LLC; Berlin/Heidelberg, Germany: 1991. [Google Scholar]
  • 118.Balian R. Incomplete descriptions and relevant entropies. Am. J. Phys. 1999;67:1078–1090. doi: 10.1119/1.19086. [DOI] [Google Scholar]
  • 119.Balian R., Valentin P. Hamiltonian structure of thermodynamics with gauge. Eur. Phys. J. B. 2001;21:269–282. doi: 10.1007/s100510170202. [DOI] [Google Scholar]
  • 120.Balian R. Poincaré Seminar 2003. Springer Science and Business Media LLC; Berlin/Heidelberg, Germany: 2004. Entropy, a Protean Concept; pp. 119–144. [Google Scholar]
  • 121.Balian R. Information in statistical physics. Stud. Hist. Philos. Sci. Part B Stud. Hist. Philos. Mod. Phys. 2005;36:323–353. doi: 10.1016/j.shpsb.2005.02.001. [DOI] [Google Scholar]
  • 122.Balian R. The Entropy-Based Quantum Metric. Entropy. 2014;16:3878–3888. doi: 10.3390/e16073878. [DOI] [Google Scholar]
  • 123.Balian R. François Massieu et les Potentiels Thermodynamiques, Évolution des Disciplines et Histoire des Découvertes. Académie des Sciences; Paris, France: 2015. [Google Scholar]
  • 124.Berthet Q., Blondel M., Teboul O., Cuturi M., Vert J.-P., Bach F. Learning with Differentiable Perturbed Optimizers. arXiv preprint. 20022002.08676 [Google Scholar]
  • 125.Blondel M., Martins A.F.T., Niculae V. Learning with Fenchel-Young Losses. J. Mach. Learn. Res. 2020;21:1–69. [Google Scholar]
  • 126.Wainwright M.J., Jordan M.I. Graphical Models, Exponential Families, and Variational Inference. Found. Trends Mach. Learn. 2007;1:1–305. doi: 10.1561/2200000001. [DOI] [Google Scholar]
  • 127.Yahyai M. Ph.D. Thesis. Université de Metz; Metz, France: Jun 23, 1995. Représentations Étoile du Revêtement Universel du Groupe Hyperbolique et Formule de Plancherel. [Google Scholar]
  • 128.Bertrand J., Irac-Astaud M. Characterization of su(1,1) coherent states in terms of affine wavelets. J. Phys. A Math. Gen. 2002;35:7347–7357. doi: 10.1088/0305-4470/35/34/308. [DOI] [Google Scholar]

Articles from Entropy are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES