Skip to main content
Entropy logoLink to Entropy
. 2021 Feb 10;23(2):215. doi: 10.3390/e23020215

The Principle of Covariance and the Hamiltonian Formulation of General Relativity

Massimo Tessarotto 1,2, Claudio Cremaschini 2,*
Editor: Alberto Porzio
PMCID: PMC7916769  PMID: 33579013

Abstract

The implications of the general covariance principle for the establishment of a Hamiltonian variational formulation of classical General Relativity are addressed. The analysis is performed in the framework of the Einstein-Hilbert variational theory. Preliminarily, customary Lagrangian variational principles are reviewed, pointing out the existence of a novel variational formulation in which the class of variations remains unconstrained. As a second step, the conditions of validity of the non-manifestly covariant ADM variational theory are questioned. The main result concerns the proof of its intrinsic non-Hamiltonian character and the failure of this approach in providing a symplectic structure of space-time. In contrast, it is demonstrated that a solution reconciling the physical requirements of covariance and manifest covariance of variational theory with the existence of a classical Hamiltonian structure for the gravitational field can be reached in the framework of synchronous variational principles. Both path-integral and volume-integral realizations of the Hamilton variational principle are explicitly determined and the corresponding physical interpretations are pointed out.

Keywords: Einstein-Hilbert variational principle, Hamiltonian theory of GR, ADM Hamiltonian theory, manifest covariance

PACS: 03.50.-z, 04.20.-q, 04.20.Cv, 04.20.Fy

1. Introduction

Despite being criticized in general terms by some of his contemporaries for its explicitly non-manifestly covariant approach both in classical and quantum gravity (Hawking [1], Isham [2]), nevertheless the so-called ADM Hamiltonian theory (Arnowitt, Deser and Misner [3,4]) has never been seriously challenged—possibly because of its complexity—in its mathematical and physical setup. In the following we present an attempt to do so in an exhaustive way. For this purpose, however, we shall preliminarily address the corresponding Lagrangian theory for the Einstein field equations (EFE) [5,6,7]. The review of corresponding literature approaches is useful to display the existence of a novel variational formulation whose validity is actually instrumental for the construction of a full, that is, Hamiltonian, variational theory of General Relativity (GR). Contrary to the customary Einstein-Hilbert (EH) setting, the key feature of such variational approach is that of being constraint-free, that is, subject to a so called “extremal constraint” and to be based on the identification of the variation operator, as customary in mathematical physics, in terms of the Frechet derivative. Indeed, its validity does not prevent the existence of independent (and thus unconstrained) variations of the (relevant) Lagrangian field variables. As far as ADM theory is concerned, in particular, the goal is to point out some of its most relevant critical aspects and inadequacies, which include in particular its obvious intrinsic non-manifestly covariant character. We anticipate, however, that the really crucial conclusion which is reached here is that, ADM theory does not provide, in a proper mathematical sense, a truly Hamiltonian theory of GR. Unfortunately this does not appear just a minor nuisance or an unfortunate minor accident.

The main goal of the paper thus remains that of seeking a possible way-out, that is, to determine a Hamiltonian theory of GR in a proper sense, namely in which the corresponding dynamical (i.e., Euler-Lagrange) equations are identified with ordinary differential equations and which, besides being manifestly-covariant, is also constraint-free and gauge-dependent. As we intend to show, such a theory is rooted on the classical manifestly-covariant Hamiltonian theory of GR recently developed in a series of papers [8,9,10]. For this purpose, here we intend to formulate in detail the appropriate variational principles, displaying explicitly their connection with EFE, their mutual relationship as well as their role for the Hamiltonian formulation of GR.

Motivations and Outlook

The property of general covariance of physical laws and its explicit functional realization, namely the corresponding Principle of Manifest Covariance (PMC), represent a fundamental aspect of theoretical physics and mathematical physics, which lays at the basis of the conceptual framework of General Relativity and provides a prerequisite of consistency for the establishment of relativistic field theories (RFT). According to PMC, in particular, it should always be possible to cast the physical laws of RFT in 4-tensor form with respect to the group of local point transformations which preserve the structure of space-time [11]. The validity of such a principle represents undoubtedly a severe constraint on the admissible physical theories, requiring in many cases the need of an overhaul, or even a complete reformulation, of existing theories. In fact it provides strict guidelines and recipes on the formal representation of mathematical relationships holding among the same laws, but also encodes a deeper meaning related to the admissible realization (and possible universal character) of the relevant equations, physical parameters, classical or quantum phase-functions and observables. In this sense, general covariance affects both the classical and quantum description of continuum fields in RFT, for the search respectively of appropriate variational (i.e., Lagrangian and Hamiltonian) formulations for classical field theories and of canonical quantization methods for the corresponding quantum field theories.

The general covariance principle must therefore be interpreted as a paramount guiding principle for the consistent variational formulation of classical equations of General Relativity and the establishment of a quantization approach yielding a consistent covariant quantum gravity theory, possibly including the combined effect due to classical or quantum source fields, like the electromagnetic or scalar fields to be described on the same ground.

The subject of this paper is the investigation of the validity of the principle of covariance for the variational formulation of classical General Relativity and its implications for the establishment of a Hamiltonian theory for the Einstein field equations (EFE). The focus here is on the conceptual and mathematical meaning of the principle of general and manifest covariance in classical General Relativity, with particular attention to the identification of the Hamiltonian structure underlying the Einstein field equations and the validity of corresponding Hamiltonian variational principles. This target requires necessarily a critical analysis of the conditions of validity of the Hamiltonian formulation of GR available in existing literature, which is realized by the well-known 3 + 1 ADM theory, in relation to the variational Lagrangian formulation of GR, as well as the establishment of the problematic limitations of such approach in connection with the principles of covariance and manifest covariance. More precisely, three main issues arise in this procedure, which pertain:

(1) A critical analysis of the Einstein-Hilbert Lagrangian variational principle that determines EFE in manifestly-covariant form as extremal Euler-Lagrange equations, when the corresponding Lagrangian function is identified with the Ricci curvature 4-scalar in which the variational field variable is represented by the space-time metric tensor gμν. In fact, the notorious difficulty which characterizes this approach to the Lagrangian formulation of GR arises due to the non-canonical structure of the E-H Lagrangian function that carries second-order partial derivatives of the variational metric tensor contained in the Ricci tensor. As such, distinctive realizations can be envisaged for the same variational principle, differing for the functional dependence of the Ricci curvature tensor on the variational fields and the way in which its variation is treated. The goal of the investigation is to show that the approaches available in the literature can be classified as asynchronous boundary-constrained variational principles for the EH theory. In this framework, the contributions generated by variations of the metric tensor carried by the Ricci tensor cannot contribute to the extremal equations if the latter are to be identified with EFE. Therefore, this necessarily requires to set up appropriate boundary constraints that warrant the drop of such undesirable variational terms. In contrast, a novel formulation is pointed out, which realizes an unconstrained form free of previous boundary constraints and that exhibits also the remarkable features of restoring the canonical form for the Lagrangian function (which depends on field variable and at most on its first-order derivative), of satisfying the principle of covariance and of yielding EFE as extremal equations.

(2) The investigation of the ADM variational formulation (briefly ADM theory) obtained by implementing a 3 + 1 splitting of space-time in terms of so-called ADM field variables and its relationship with the Einstein-Hilbert variational approach in connection with the Hamiltonian formulation of GR. A crucial aspect of ADM theory lies, in fact, in the violation of the principle of manifest covariance due to the peculiar choice of reference system in which time and space coordinates are no longer treated on equal footing, and in particular where a suitable coordinate-time is singled out to play the role of dynamical (i.e., evolution) parameter for the theory. In the literature this peculiar aspect is regarded as an intrinsic property of Hamiltonian theories for continuum fields, in contrast to the manifestly-covariant form of corresponding Lagrangian formulations. For this reason, in the scientific community the implementation of ADM theory in the E-H variational principle is recognized as the viable route that leads to the definition of a 3-dimensional constrained Hamiltonian representation of EFE. However, based on the results pointed out in (1), a deeper insightful critical analysis of the Hamiltonian character of the Einstein equations is required in order to question the foundation of the subject. The strategy that is followed consists in expressing both the boundary-constrained and unconstrained Lagrangian principles in terms of the corresponding non-manifestly covariant ADM frameworks and then to require the simultaneous validity of the resulting ADM variational equations and their equivalence with EFE. This yields the main result of the work, which concerns the proof of the non-Hamiltonian character of ADM theory and the failure of this approach in providing a symplectic structure of space-time. Contrary to the common belief, it is not sufficient to implement a suitable coordinate system where a time variable is distinguished from space variables in order to obtain a Hamiltonian field theory. Indeed, the ADM Hamiltonian theory does not represent an intrinsic Hamiltonian structure of space-time, but only an apparent one. The reason is that also the ADM formulation remains rooted on the non-canonical structure of the E-H Lagrangian, and it still carries second-order derivatives of the generalized coordinate field, an aspect which is sufficient to prevent the definition of canonical Hamiltonian theory.

(3) The search and construction of a new Hamiltonian representation of EFE which can overcome the conceptual limitations of the ADM setting and be simultaneously globally valid, unconstrained and manifestly covariant. In particular, it is proved that a solution reconciling the physical requirements of covariance and manifest covariance of variational theory with the existence of a classical Hamiltonian structure for the gravitational field corresponding to a true symplectic structure of space-time can be effectively reached in the framework of synchronous variational principles, based on the outcomes earlier provided by Refs. [8,9,10]. In order to afford a straightforward comparison with the ADM setting, the parametrization in terms of an invariant proper-time parameter is first introduced. This allows in turn to construct synchronous Hamiltonian variational principles expressed either by path-integral or volume-integral realizations. Remarkably, both representations yield manifestly-covariant Hamilton equations in canonical form, namely expressed as evolution equations in terms of the proper-time dynamical parameter, which under prescription of suitable initial conditions for the canonical state recover identically EFE. The novelty of the theory lies also in the corresponding physical interpretation that is inferred about the role of the Ricci 4-scalar in the EH Lagrangian function, which motivates and supports in turn this kind of approach. In fact, consistent with the principle of manifest covariance, within the framework of the synchronous Hamiltonian variational principle the latter should be understood as a potential term expressing the coupling between the variational metric tensor and the Ricci tensor carrying information about the geometry of the background space-time.

The subject of the present investigation summarized in points (1)–(3) listed above can establish a reference framework for the theoretical foundations of GR Lagrangian and Hamiltonian variational formulations. Conceptual implications of the theory involve both classical GR as well as quantum gravity, namely perspective covariant canonical quantization approaches for the gravitational field based on the same Hamiltonian theory.

2. Asynchronous Variational Approaches

The subject of the investigation is therefore centered on the crucial relationship between the covariance principle and the Hamiltonian theory of GR.

To start with we stress that a key issue concerns couching the theory on the appropriate mathematical framework, that is, based on a suitable prescription of the related functional setting and the notion of functional variation. For this purpose a preliminary brief survey is appropriate regarding some of the popular variational principles for GR equations that are customarily adopted in the literature. The reason, as anticipated above, is to point out a realization of the Lagrangian variational principle (for EFE) which, besides being manifestly covariant, is also unconstrained. Such a feature, which departs from previous constrained formulations found in the literature, is of crucial importance. In fact, it turns out to be an essential ingredient for the establishment of a corresponding constraint-free Hamiltonian variational formulation (see Section 4).

The review presented here pertains in particular to the Einstein-Hilbert (EH) Lagrangian variational approach (referred to for brevity as EH theory), originally developed by Einstein himself to determine the actual form of EFE. For this purpose, we consider first the so-called asynchronous variational principle. This case occurs when the functional setting is prescribed so that the 4-scalar volume element associated with the variational functional depends functionally on suitable real and symmetric fields g(r)gμνgμν. The latter are then considered variational, that is, still undetermined 4-tensors. The relevant variational functional which occurs in EH theory is the so-called EH action functional [5]. In the general case of source fields, that is, non-vacuum conditions, this is identified with the 4-scalar functional

SEH(g(r))=Q4d4rgLEH(g)+LF(g), (1)

where the argument g=g(r) denotes the functional dependence in terms of the variational field, while

dΩd4rg (2)

is the invariant 4-scalar volume element of the Riemann spacetime Q4,g(r) and g under the square root denotes the determinant of g(r). Furthermore, LEH and LF, both assumed as 4-scalars, identify the Einstein-Hilbert Lagrangian function of the gravitational field and the Lagrangian of possible additional fields, to be properly specified. By definition, here the function LEH represents the vacuum contribution

LEH=gμνRμν2Λ, (3)

where in standard notation Λ denotes the cosmological constant, while Rμν is the variational Ricci tensor which is considered function of the same variational tensor gμν by means of the Christoffel symbols. As such it carries the second partial derivatives of gμν with respect to the 4-position rμ. Notice that in this framework the variational tensor field g(r)gμν is also a metric tensor associated with the spacetime Q4,g (Q4R4), so that it raises and lowers tensor indices and hence must satisfy suitable orthogonality conditions. This means that its functional setting, including the same conditions, is constrained in the sense that it is necessarily of the form

gCg(r)gextr(r)+δg(r)C2(Q4)gμνgμk=δνk, (4)

with Q4R4 and gC denoting the functional class of “normalized” varied functions g(r). Furthermore, here the 10 covariant and counter-variant components of symmetric tensor field g, namely respectively gμν and gμν, are considered linearly independent, despite being required to fulfill the orthogonality constraints

gμνgμk=δνk, (5)

which imply in turn the “normalization” condition gμνgμν=4. In addition, each varied function g(r) is assumed of the form

g(r)gextr(r)+δg(r), (6)

gextrgextr,μν(r)gextrμν(r) representing a suitably defined extremal real and symmetric tensor field (see below) and δg(r)δgμνδgμν a real and symmetric variation tensor field. We stress here that in the framework of EH theory each g(r)gμν(r)gμν(r) is regarded as a metric tensor field which raises and lowers tensor indices and for each g(r)gμν(r)gμν(r) its countervariant components gμν are by construction prescribed so that the orthogonality conditions (5) are identically satisfied. In addition, the variations δgμν are regarded as linearly independent and possibly subject to suitable boundary conditions (to be applied on appropriate boundaries). Finally, in the same setting the three quantities g, gμν and Rμν are all considered as variational in Equation (1), although g is not by itself a 4-tensor.

2.1. The Einstein-Hilbert Lagrangian Variational Principle

Given these premises, we first address in particular the Einstein-Hilbert (EH) variational principle. In this case the extremal field gextr(r), which enters Equation (6), is identified with the solution of the Einstein field equations and therefore for better clarity is denoted here with the symbol

gextr(r)g^(r)=g^μν(r). (7)

The tensor field g^(r) will be henceforth referred to as “background” metric tensor field. Then, the EH variational principle (or EH action principle) is obtained by prescribing that for arbitrary variations δg(r) it occurs that:

δSEH(g)g=g^(r)=0, (8)

with the symbol δ denoting, as usual, the variation operator. Notice that here the precise mathematical definitions matter. In fact, a (possible) departure from previous literature lies in the definition of the variation operator δ adopted here and the prescription of its functional setting, which in case (8) is identified with gC (see, e.g., Equation (4)). Thus, according to the standard definition available in the mathematical literature [12], for any real smooth function or functional F(g(r)) its functional variation is identified with its Frechet derivative. The definition is based on the representation of a generic varied function g(r) which, thanks to Equation (7), is given now in terms of

g(r)=g^(r)+δg(r). (9)

Furthermore, for definiteness, we denote by g(r,α) the related parametrized varied function

g(r,α)=g^(r)+αδg(r), (10)

with α1,1 being a real 4-scalar parameter to be assumed independent of r. Then, the Frechet derivative of the function (or functional) F(g(r)) is given by

δF(g(r))ddαF(g^(r)+αδg(r))α=0. (11)

The obvious implication is therefore that the variation of the Ricci tensor Rμν when it is considered as a function (or functional) of the variational tensor field g(r), is therefore generally non-zero (for non-vanishing variation δg(r)) because the dependence of the Christoffel symbols Γilm in terms of the variational field g(r) implies that

δΓilm(g(r))ddαΓilm(g^(r)+αδg(r))α=0, (12)

which is therefore different from zero. We stress, on the other hand, that by construction δg^(r)=0, so that if one assumes Γilm=Γilm(g^(r)), then identically

δΓilm(g^(r))0. (13)

Notice, however, that this prescription differs from the so-called Palatini approach [13] in which the Christoffel symbols Γilm are considered independent variables and as such can therefore in principle be required to satisfy the constraint requirement

δΓilm=0 (14)

if the variation δ is performed only with respect to δg(r).

We stress that in Equation (8) the tensor field g^(r) is prescribed in such a way that the same equation is satisfied identically for arbitrary variations δg(r). The variation of all relevant functional or functions can similarly be defined. Thus, while the variation of g^(r) vanishes identically, being independent of α, so that δg^(r)=ddαg^(r)α=0=0, by definition δg=ddαg(r,α)α=0. Hence, the operator δ defined in this way does not preserve the 4-scalar volume element dΩ, since

δdΩd4rδg0, (15)

and for this reason is referred to as asynchronous variational operator. Because of the consequent explicit variational contribution which arises due to the non-constant volume element, and in analogy with the terminology adopted in classical mechanics, the EH variational principle (8) is thus denoted as asynchronous variational principle.

Its implications are in principle straightforward. In fact, by identifying the variational Lagrangian density LvarLvar(g) with

Lvar(g)=gLEH(g)+LF(g), (16)

the variational principle (8) yields the corresponding Euler-Lagrange equations, which in symbolic form (i.e., expressed in terms of the functional derivative with respect to gμν, or equivalently gμν) take the form

δSEHδgμνg=g^(r)Lvar(g)gμνg=g^(r)=0. (17)

Notice that here the solution (g^(r)) of the Euler-Lagrange Equation (17) necessarily coincides with the extremal value of the variational tensor field g, that is, such that Equation (8) is satisfied for arbitrary variations δg(r) belonging to the functional class gC. Nevertheless, the precise evaluation of the Frechet derivative (11) and its related functional derivative, that is, Equation (17), are not straightforward, the reason being that the Ricci tensor appearing in LEH(g) contains second-order partial derivatives of g(r).

2.2. Boundary-Constrained Variational Principle of EH Theory

In order to tackle such a problem it is convenient to distinguish in the EH variational principle (8) two variational contributions, by letting

δSEH(g)=δSEH(g)expl+δSEH(g)impl, (18)

with δSEH(g)expl and δSEH(g)impl denoting respectively the explicit and implicit variations of the action functional SEH(g). The first one is obtained by varying SEH(g) while keeping the Ricci tensor constant, while the second one contains the variation of the Ricci tensor only (see Equations (3) and (6) below). Thus, in the vacuum case with non-vanishing cosmological constant the first contribution contains the explicit contributions coming respectively: (a) from the variation of the determinant entering the 4-scalar volume element (2), that is, yielding its explicit asynchronous variation δdΩ=d4xδg, where the identity

δg=12gμνδgμν (19)

holds; (b) from the explicit contribution carried by gμν in LEH and LF (with LF being assumed independent of Rμν). As a consequence:

δSEH(g)expl=Q4d4rδggμνRμν2Λ+gδgμνRμν+δQ4d4rgLF(g). (20)

Direct evaluation of the various terms on the rhs of Equation (3) yields

δSEH(g)expl=Q4d4rgRμν12RΛgμνδgμνQ4d4rgδgμνκTμν, (21)

which implies

Rμν12RΛgμν=κTμν, (22)

namely, the general form of the non-vacuum Einstein field equations (EFE) for the unknown tensor field gμν(r). Here the notation is standard. Thus, Rμν and R=gμνRμν are respectively the Ricci 4-tensor and Ricci 4-scalar, Λ is the 4-scalar cosmological constant, while Tμν is the stress-energy tensor associated with the external source fields described by the external-field Lagrangian density LF(g). Finally, κ is the universal constant κ=8πG/c4, while G is the Newtonian constant of gravitation with c being the speed of light in vacuum. The 4-tensor Equation (5) represents a second-order, intrinsically non-Hamiltonian PDE, whose solution will be denoted here with the symbol g^(r)=g^μν(r).

Given the result expressed by Equation (5) one therefore expects that the remaining contributions should vanish identically. These are carried by the implicit variation δSEH(g)impl, obtained by varying SEH(g) with respect to the Ricci tensor only, namely:

δSEH(g)impl=Q4d4rggμνδRμν. (23)

The obvious implication is therefore that the variational functional constraint condition

δSEH(g)impl=0 (24)

must necessarily hold in order to recover exactly EFE. As a consequence, the EH variational principle in Equation (8) must be considered as a constrained one, the constraint (24) realizing effectively a boundary-constrained EH variational principle. The peculiar difficulty that arises when trying to satisfy the variational constraint condition (24), however, is due to the fact that, as anticipated, the Ricci tensor contains second-order partial derivatives of the metric tensor, so that on the whole the variational Lagrangian is not written in standard form and one obtains that the implicit term still carries first-order derivatives of the generalized coordinate field gμν. In fact, let us recall the definition of the Ricci scalar and tensor (R and Rμν)

R=gμνRμν=G+gμνΓμνlrlΓμllrν, (25)

where G is a function of gμν of the form

G=Hμν(Γ)gμν,Hμν(Γ)ΓilmΓkmlΓiklΓlmm, (26)

and Γilm are the Christoffel symbols prescribed with respect to gμν. Since by construction αgμν0, explicit calculation gives

Q4d4rgR=Q4d4rgG+Q4d4rgαgβkΓβkαgβαΓβkk. (27)

As a consequence, it follows that in order to satisfy the constraint (24) it should be identically:

δSEH(g)impl=Q4d4rggμνδHμν(Γ)+Q4d4rgαwα=0, (28)

with wα denoting the 4-vector

wα=gβkδΓβkαgβαδΓβkk. (29)

Thus, in particular, the last integral thanks to the divergence theorem reduces to

Q4d4rgαwα=Q4dσnαwα, (30)

with nα denoting the inward unit tensor locally orthogonal to the surface Q4 of the boundary Q4. In the literature various possible tentative solutions have been proposed to satisfy the constraint Equation (28).

Choice A—The first approach is the one proposed by Einstein himself [11] (see also Landau-Lifschitz [5]). The claim is that it should be sufficient to impose that on the boundary Q4 of integration domain the variations δΓikl and δΓlkk are prescribed in such a way that the 4-vector wα is set identically zero, namely

wα|Q4=0. (31)

In fact, according to the same claim, the contribution of the volume integral in the first term on the rhs of Equation (28) should be regarded as null being evaluated in the local frame in which the spacetime is locally flat, so that the tensor function Hμν(Γ) should remain constant with respect to arbitrary variations, so that in a locally flat system

Q4d4rggμνδHμν(Γ)=0. (32)

A criticism arises on this approach. We first notice that the boundary condition (31) is perfectly admissible. It can be viewed, in fact, as a gauge condition to the variational Lagrangian density Lvar(g) (16). Indeed it is always possible to add to Lvar(g) a suitable boundary term contribution which cancels exactly a possible non-vanishing surface boundary term wα|Q40 which might be carried by Lagrangian density.

The argument leading to the proof of Equation (32) is instead of dubious validity. The objection is as follows. Validity of Equation (32) for arbitrary variations δg(r) belonging to g(r)C requires, in turn, that the Christoffel symbols should be almost everywhere such that

δHμν(Γ)=0. (33)

However, for a prescribed metric tensor g(r)=g^(r)+δg(r) and a bounded set Q4, the approximate local constraint condition

δHμν(Γ)=0+O(εγ) (34)

can be fulfilled at most on a countable set of covering ε-neighborhoods Vε(ri) of riQ4, with radius ε (being ε a suitable infinitesimal and γ a suitable constant >1). Nevertheless, such a choice of the coordinate system does not warrant that the same requirement (34) is satisfied for arbitrary variations δg(r) belonging to g(r)C. This implies that Equation (34) cannot be realized by a local point transformation which preserves the differential manifold Q4,g(r).

As a note added in proof we notice that it is possible to show [14] that a diffeomorphism which maps in each other almost everywhere a curved spacetime Q4,g(r) on the flat Minkowski spacetime Q4,η(r) can only be realized by a so-called non-local point transformation (NLPT). Such a transformation, which incidentally is analogous to the one which realizes the so-called Einstein’s teleparallel approach to GR, is however a non-local transformation which is not applicable in the present context. As a consequence, Equation (16) effectively requires validity of the further variational constraint

δHμν(Γ)=0, (35)

which in turn requires Equation (14) to hold identically in the whole spacetime Q4, including (thanks to continuity) the boundary Q4. However, since in the original approach by Einstein the variations δgμν and δΓikl are not considered as independent, as typically assumed instead in Palatini-type approaches, the issue arises of the actual validity of the constraint condition (14) in the framework of the functional setting gC.

We notice that a similar possibility to the one proposed by Landau-Lifschitz is that advanced by Wald in Ref. [7], which concerns Equation (31) only and is based on the claim that the variation of the Ricci tensor should give rise to a divergence term acting on the variation δgμν which should therefore vanish by requiring that the same variation vanishes on the boundary. According to Ref. [7], “this term does not vanish for general variations where gμν is held fixed on the boundary, although it does vanish for variations where the first derivatives of gμν also are held fixed”.

Choice B—A further possibility concerned Equation (31) also advanced by Wald in Ref. [7], is based on the addition of an “ad hoc” extra surface-term contribution to the action functional (1), which is a function of wα and is constructed in such a way to exactly balance the implicit contributions originating from the integration by parts performed in Equation (28). This approach however cannot be implemented to treat the constraint (32).

Choice C—Finally, another possible route, already anticipated above, arises in the context of the Palatini approach [13] provided the functional setting in the EH action functional (1) is properly prescribed. In such a case in fact, in contrast to the prescription (12) given above, the variations δgμν and δΓikl are considered independent by construction (see for example [4] and the original derivation by Schroedinger [15]). Therefore in such a context the constraint condition (14) can in principle be legitimately set.

Regarding the possible choices indicated above there are however two crucial aspects to take into account. The first one is that, in all cases, the action principle based on the Einstein-Hilbert action may not be well posed [16,17]. In fact, since the Lagrangian is represented in terms of the variational Ricci scalar R(g(r)), derivation of the Einstein equations requires fixing both the metric tensor and its first derivative on the boundary, which is inconsistent with the same Einstein equations. The second one concerns Choice B. It refers to the fact that the prescription of boundary terms involves either assigning a suitable GR-frame (i.e., a coordinate setting, such as a foliatiation with respect to the coordinate time) for their representation in terms of variational variables, or to adopt a general representation in terms of connections Γbca that have a non-tensorial character. Both possibilities necessarily violate manifest covariance. We stress that the same inconvenience is expected to arise in the case of alternative theories of GR (for example including torsion or f(R) effects) in which boundary terms are taken into account [18].

Based on the these consideration, apart possibly only the Palatini approach (which, however, adopting Frechet derivatives requires a suitable reformulation not reported here for brevity), the remaining tentative solutions A and B appear therefore questionable. The fundamental question which arises is therefore whether, in the context of the EH action principle (8) and subject to the boundary constraints (24), there is a possible way-out solution which effectively realizes an asynchronous variational principle for EFE and permits a consistent treatment of the whole constraint (28).

2.3. An Unconstrained Lagrangian Variational Principle

As we intend to prove here, the solution of the difficulties indicated above is represented by an unconstrained Lagrangian variational principle for EH theory. Inspired to a concept originally pointed out in Ref. [8], the new approach is represented by a different kind of variational approach which is based on the adoption of so-called “extremal” constraints. As explained below, such a kind of constraints are peculiar because they actually do not limit either the functional class of the variations δg(r) or that of the varied functions g(r), thus leaving both δg(r) and g(r) independent. As a consequence, the resulting variational principle remains, in a proper sense, unconstrained. As a further notable feature, contrary to the original EH setting, the issue of the well-posedness for the related variation principle does not arise here. In fact it is avoided alltogether thanks to the fact that, as explained in detailed below, in such a context the Ricci tensor is considered prescribed (i.e., it is not treated as variational).

Nevertheless, to reach the goal an extended functional setting, different from gC, is required in which the variational symmetric tensor functions g(r) cannot generally be treated any more as metric tensors. As far as the functional setting is concerned, this implies that the covariant and countervariant components gμν and gμν are not required to satisfy the orthogonality condition (5), while a background metric-tensor representation is adopted. In other words, there must exist by construction a background tensor field, to be identified with a particular solution of EFE g^(r)g^μν(r)g^μν(r) (see Equation (5)), which generates the geometric properties of the spacetime, in particular raising and lowering tensor indexes and prescribing the Christoffel symbols (and therefore the Ricci tensor). The functional setting is now identified with the unconstraint setting gU defined as

gUg(r)g^(r)+δg(r)C2(Q4)gμν=g^μαg^νβgαβ(r),g^μνg^μk=δνk, (36)

where, in further difference with respect to gC, all tensor indices are now raised and lowered by g^(r).

The introduction of the setting (36) is required in order to satisfy the variational constraint condition. In fact, a necessary condition for the validity of the variational functional constraint condition (24) is to require that identically, that is, for arbitrary variations δg(r), the following two variational equations, that is, respectively

δRμνddαRμν(g^(r)+αδg(r))α=0=0, (37)

and

δRμνddαRμν(g^(r)+αδg(r))α=0=0, (38)

hold. In view of the definition of the Frechet derivative and ruling out possible constraint conditions on the variations δg(r), this means that the Ricci tensor, both in its covariant and countervariant components Rμν and Rμν, must necessarily be functions of g^(r) only. Therefore, in the unconstrained variational principle Rμν should behave as an extremal 4-tensor in both its covariant and countervariant representations, that is,

Rμν=Rμν(g^(r))R^μν, (39)
Rμν=Rμν(g^(r))R^μν, (40)

with the background tensor field g^(r) being identified with a particular solution of EFE (5), that is, the extremal solution g^(r). Such a peculiar kind of constraints will be therefore referred to as extremal constraint conditions. However, one further notices that this requires also that

R^μν=g^μα(r)g^νβ(r)Rαβ(g^(r)), (41)

which means that the background tensor g^(r) must be the metric tensor, that is, the tensor which raises and lowers tensor indices of all tensor fields. In fact, in order to preserve the 4-scalar property of the Lagrangian density (see Equation (3)), also the varied functions g(r) must behave accordingly (i.e., its indexes must be raised and lowered by g^(r)). This implies also that the orthogonality conditions (5) cannot generally be fulfilled because the simultaneous validity of Equations (39) and (40) is generally not permitted when the varied functions g(r) belong to (4). A fact which forces one to adopt the more general functional setting (36).

As a result, one concludes therefore that within the functional class gU necessarily Equations (37) and (38) are identically satisfied, without implying any constraint condition for the varied function g(r), nor on their variations δg(r). Then, in the same functional class, and based on the setting (39) and (40), the EH action integral (1) becomes

SEH(U)(g,g^)Q4d4rLvar(g,g^),Lvar(g,g^)=gLEH(g,g^)+LF(g,g^), (42)

where following Equation (3) and including explicitly the dependence in terms of the extremal tensor field g^(r), the integral SEH(U)(g,g^) and

LEH(g,g^)=gμνR^μν2Λ (43)

identify respectively the unconstrained EH variational functional and the unconstrained variational Lagrangian density. The EH variational principle (8) thus reduces to the unconstrained variational principle

δSEH(U)(g,g^)=ddαSEH(U)(g^(r)+αδg(r),g^(r))α=0=0, (44)

where the corresponding Euler-Lagrange equations become now

δSEH(U)(g,g^)δgμνLvar(g,g^)gμν=0, (45)

the partial derivative with respect to gμν being performed at constant g^g^μν. Straightforward algebra then delivers the expected result, showing that the unconstrained Euler-Lagrange equations determined in this way indeed coincide identically with EFE (i.e., Equation (5)).

The result is striking because it shows that the constrained EH variational principle reduces in this case to an unconstrained one. This occurs because both the varied tensor functions g(r) and their variations δg(r) remain completely unaffected by the constraints (39) and (40), that is, provided g(r) belongs to a suitably chosen functional class gU. The peculiarity arises because the same constraints are identically satisfied by themselves and do not affect the variational principle itself. It should be stressed that such a result has formal analogies with the Palatini variational principle because in both cases the Ricci tensor is regarded as independent of the variational tensor gμν. The resemblance, however, stops here because, unlike the Palatini approach, here the connections are not considered variational.

2.4. An Equivalence Theorem

The equivalence of the two variational principles given above can readily be shown, provided they are both couched in the same functional setting, that is, gU and of course provided the boundary constraint condition (24) is also set (this is certainly not a restrictive condition considering that variational functions g(r) belonging to gU also obviously belong to gC). In this regard an important preliminary remark must be made. This concerns the fact that the explicit variational contribution δSEHexpl when evaluated with respect to the functional class g(r)U takes the same representation given by Equation (4) (the reason being that the identity (19) still holds). This makes possible to evaluate in the same functional class gU the difference functional

ΔSEH(g,g^)SEH(g)SEH(U)(g,g^)=Q4d4rΔLvar(g,g^), (46)
ΔLvar(g,g^)Lvar(g)Lvar(g,g^). (47)

Thus, in view of Equations (4) and (44) it follows that the variational principle δΔSEH(g,g^)=0 yields identically:

Q4d4rδΔLvar(g,g^)=0. (48)

This means therefore that when setting g(r)=g^(r) the previous equation must be satisfied identically. As a consequence, ΔLvar(g,g^) represents effectively a gauge contribution to the variational Lagrangian density.

The implication is therefore that the boundary-constrained EH variational principle Equation (8) and the unconstrained one (44) are completely equivalent, both yielding the same form of EFE. This means also that the constraint conditions (37) and (38) should be interpreted as natural constraint conditions, in the sense that they are necessarily satisfied in the framework of EH theory. In other words, it is always possible to get rid altogether of the variations δRμν which enter the Euler-Lagrange equations. This provides effectively a new perspective in EH theory, since the property holds for arbitrary coordinates systems and is therefore frame-independent.

2.5. Covariance and the Issue of Missing Gauge Invariance

To complete the analysis, the property of covariance and the issue of (missing) gauge invariance of the EH theory is briefly addressed here. First, the manifest covariance and covariance properties of the EH variational principle, Lagrangian density and of the corresponding EFE, are readily established. These properties follow from two basic ingredients:

(1) The 4-tensor nature of the EH action functional, either in its constrained or unconstrained forms, that is, respectively given by Equations (8) and (44).

(2) The fact that variations are performed with respect to a tensor field, that is, gμν.

Thanks to the 4-scalar nature of the (invariant) volume element dΩ (see Equation (2)) and similarly thanks to the assumption that the Langrangian densities are by construction 4-scalars, either if represented in terms of their constrained or unconstrained forms, namely LEH(g)+LF(g) and LEH(g,g^)+LF(g,g^) respectively, implies that necessarily the EH variational principle (i.e., the action integral) and EFE are consistent with the principle of manifest covariance. As an implication, the covariant nature of the theory is established at once. In fact, given an arbitrary local point transformation, that is, an arbitrary diffeomorphism

rr=r(r) (49)

which maps the spacetime Q4,g^(r) in itself, both the EH variational principle and the corresponding EFE (both written equivalently either in unconstrained or constrained forms) are covariant.

A different consideration however holds for the Lagrangian density, which represents the effective quantity that generates the EL equations after taking variations and which, for this reason, should encode the physical properties of the field. In the case of EH variational principle the variational Lagrangian density Lvar(g) defined by Equation (16) is not a 4-scalar, since the contribution of the determinant g is not either. This feature is unusual in field theory, because it means that the resulting EL equations, namely EFE, are determined partly by a contribution carried by the Lagrangian function LEH(g) and partly by a non-tensor contribution which defines the differential volume element of the action integral and which is independent of the actual realization of the same Lagrangian function. This is not a marginal aspect, but rather implies serious consequences on the theoretical framework of variational theory for GR. In particular:

(1) Either the constrained or unconstrained symbolic Euler-Lagrange equations, namely (17) and (45), are not set in manifest-covariant form. This feature notwithstanding, however, upon evaluating explicitly the functional derivatives in the two cases one immediately realizes that the corresponding (and equivalent) EFE represented by Equation (5) are manifestly covariant too, that is, are set in 4-tensor form.

(2) The EH asynchronous variational formulation lacks a basic gauge invariance property which should however characterize standard variational theories of continuum classical fields. We refer here to the gauge-invariance property provided by the trivial gauge transformation acting on the variational field Lagrangian L in terms of an arbitrary constant 4-scalar C=const. by means of the transformation

LL+C. (50)

In the standard theory of variational principles the addition of a constant in the Lagrangian must leave invariant the extremal equations. In contrast, in the context of asynchronous EH principle it follows that, when Equation (50) applies, any Lagrangian density L of the type L=gL transforms necessarily as

LL+gC. (51)

Therefore, the introduction of the additive constant C, instead of leaving invariant the Euler-Lagrange equations, changes the functional form of EFE, thus generating a non-vanishing contribution in the same extremal equations which is equal to 12Cgμν. However, the ensuing lack of such gauge-invariance property - which incidentally affects both the EH and ADM theories—is not simply a mere unfortunate inconvenience. In fact it violates a fundamental requirement usually regarded as a cornerstone of continuum classical and quantum field theory. We stress once more that this feature is ultimately related to the adoption of a non-tensorial variational Lagrangian density, which depends on g associated with the 4-volume differential element, and gives rise to a continuum field theory which is intrinsically non-gauge invariant. However, the property of gauge invariance should be regarded as a mandatory feature of variational field theories in general, to be fulfilled both by variational and extremal continuum fields. As a consequence, also the variational functional and the corresponding variational Lagrangian, together with the corresponding extremal quantities, should be necessarily determined up to a suitable gauge contribution. The violation of this property in the asynchronous EH variational theory is therefore a conceptual and theoretical obstacle to the realization of a truly manifestly-covariant variational formulation of EFE.

3. The ADM Variational Approach

Let us now briefly recall the well-known ADM theory, which in mainstream literature is commonly credited for realizing a viable Hamiltonian formulation for GR. Formulated in 1959 [3,4] and heavily inspired by previous work by Dirac [19,20], it partly relies on some of his mathematical formalism, especially Dirac’s customary identification of canonical momenta in terms of coordinate-time partial derivatives and Dirac’s theory of constrained dynamics [21].

For the purposes of the present investigation (and the economy of presentation) it is sufficient to consider here only the so-called “Hamiltonian realization” of ADM theory. The variational approach, whose starting point is based on a particular coordinate representation of the EH Lagrangian variational principle, is based on the following steps:

  • The prescription of the variational functional, which is identified with the EH action functional (1).

  • The prescription of the functional class g(r)C in which the varied functions take the generic form g(r)gextr(r)+δg(r), where gextr(r) is identified with a suitable extremal tensor field.

  • The prescription of ADM variables in terms of the varied tensor function g(r) (and hence of the extremal tensor field gextr(r) too).

  • The representation of the EH Lagrangian density in ADM variables.

In detail, once having prescribed the variational functional, the second step is based on the replacement of the variational tensor gμν(r), required to belong to g(r)C, with a suitable set of non-4-tensor variational variables, identified with the set hab(r),N(r),Na(r) to be denoted as ADM Lagrangian variables, where hab is the 3×3 variational matrix (the Latin indices a,b run from 1 to 3, while the greek ones range as usual from 0 to 3), while N and Na are respectively the lapse function and the shift 3-vector. More precisely, the ADM variables are defined by suitably prescribing the set of coordinates (GR-frame) rrμ=ct,r and the components of the variational tensor gμν(r) as follows:

g00(r)=1c2(N2+NaNa),g0a(r)=ga0=1cNa,gab(r)=hab. (52)

Notice that a priori all components of g, as well of hab(r),N(r),Na(r) are considered functions of the 4-position r=rμ, that is, generally dependent on both time and spatial coordinates. Then, direct calculation shows that in whole generality the identity

gN(r)h(r) (53)

must hold.

The fundamental aspect to highlight here is that, in view of the outcomes of Section 2, there can exist actually two different admissible representations of the Lagrangian density in ADM variables, which are related to the two possible choices of the EH Lagrangian densities Lvar(gμν) and Lvar(gμν,g^μν) (see Equations (16) and (42) respectively) corresponding to the boundary-constrained and unconstrained Lagrangian EH variational principles stated above. In the following, in order to question the nature of the ADM Hamiltonian theory for EFE, we shall therefore consider separately the two settings.

3.1. The Constrained ADM Variational Formulation

Let us consider first the relationship in terms of Lvar(gμν) originally established by Arnowitt, Deser and Misner (ADM theory [4]). Such an approach will be hereon referred to as constrained ADM variational formulation. The task involves suitably representing in terms of ADM variables both SEH(g) and the corresponding variational Lagrangian density Lvar (see Equation (16) above), assuming the functional setting g(r)C. To this end the 4-dimensional integral is split as a time integral (by setting also c=1) and a surface integral on the (constant-time) 3-surface section Σt, where t is the coordinate-time, by representing d4r as d4r=dtdΣ, with dΣ being the surface element on Σt. For simplicity, we consider here the vacuum case with non-vanishing cosmological constant (Λ0).

Starting point is the prescription of the ADM variational functional, namely

SADM(G)=dtΣtdΣLADM(G), (54)

and the related ADM variational Lagrangian density LADM(G), which are identified according to

SADM(G)SEH(g),LADM(G)=Lvar(g). (55)

Here, G(r) denotes the set of ADM canonical variables G(r)hab,pab,N,Na, with hab,N,Na being the Lagrangian variables determined by inversion of Equation (52). Then, if it is assumed that g(r) belongs to the functional setting gC (defined by Equation (4)), this means that the corresponding functional setting G(r) should be prescribed accordingly.

Notice that here the ADM variational Lagrangian density LADM(G) must be identified with the EH Lagrangian density Lvar(gμν) (see Equation (16)). This can be shown to require

LADM(G)pabh·abNH(G)NaHa(G), (56)

where the notations are standard. Thus, in particular, pab are suitably-defined “canonical momenta” conjugate to hab, h·ab is the Lie derivative with respect to time of hab, which according to the choice of ADM variables in such a reference frame coincides with the partial time derivative h·abthab, while H and Ha are the so-called “Hamiltonians”

H=h(3)R2Λ+h1pabpab12h1p2, (57)
Ha=2hDbh1/2pab, (58)

where p=paa and Dc identifies the covariant derivative prescribed on the section Σt, which when acting on pba is defined as Dcpba=hdahbehcffped. Furthermore, it is important to notice that (3)R represents the spatial curvature scalar and is defined as (3)R=hlm(3)Rlm, where (3)Rlm is the 3-dimensional Ricci tensor, which is defined in terms of the corresponding 3-dimensional Riemann tensor as (3)Rlm=(3)Rlkmk. As such, (3)Rlm still depends on first and second-order partial spatial derivatives of the Lagrangian ADM fields, namely it is generally of the form

(3)Rlm=(3)Rlm(hab,ihab,ijhab). (59)

More precisely, it is defined as

(3)Rlm=(3)Rlkmk=kγlmkmγklk+γknkγlmnγmnkγkln, (60)

with γjki denoting the 3-dimensional Christoffel symbols

γjki=12hiljhkl+khjllhjk. (61)

Finally, let us introduce the parametrized varied functions

Gextr+αδGhab(r,α),pab(r,α),N(r,α),Na(r,α), (62)

where α is a real parameter α1,1 and

hab(r,α)=habextr+αδhab,pab(r,α)=pabextr+αδpab,N(r,α)=Nextr+αδN,Na(r,α)=Naextr+αδNa, (63)

where δhabδgab, while Gextrhabextr,pabextr,Nextr,Naextr denotes the extremal solution (see below) which extremizes the variational functional. Thus, the ADM variational principle in terms of the Frechet derivative becomes:

δSADM(G)ddαSADM(Gextr+αδG)α=0=0, (64)

which must hold for arbitrary (linearly) independent variations δhab, δpab, δN and δNa defined according to the functional setting. About the notations, we stress that the extremal tensor fields habextr, pabextr, Nextr and Naextr are considered solutions of the corresponding Euler-Lagrange equations (see below Equation (65)). The same fields, based on Equation (52) determine also the corresponding extremal tensor field gextr(r) which in principle needs not to coincide with g^(r).

The formal derivation of the Euler-Lagrange equations determined by (64) is reported in detail by several authors (see for example Wald [7]). According to these references this yields for the extremal ADM fields G=Gextr the set of ADM initial-value equations which is realized respectively by

h·abNpabHNcpabHc=0,p·abNhabHNchabHc=0,H=0,Ha=0, (65)

and by the initial conditions

(hab(to),pab(to)), (66)

with to denoting a prescribed (but arbitrary) initial coordinate time. Thus, in particular, the first two equations in (65) represent two Hamilton-like evolution equations for the conjugate variables (hab,pab), formality representing Lagrangian generalized coordinates and conjugate momenta. The remaining two equations are instead respectively a 3-scalar and a 3-vector constraint equations expressed in terms of so-called “Hamiltonian” functions H and Ha, which are required to vanish identically for the extremal values of the ADM variables (hab,pab,N,Na). The basic implication which follows is therefore that, from the ADM variational principle (64), the extremal ADM variables Gextr, in particular (habextr,pabextr), are not independent being subject to the previous constraint equations (last two equations in Equation (65)).

We notice that in the literature (see [4,7]) it is claimed that in the case of vacuum the same ADM initial value equations “are equivalent to the vacuum Einstein equation”. However, it is important to recognize that the initial conditions (66) are in principle free. On the other hand, one can show that the Lagrange multipliers N,Na are uniquely determined by requiring that initially (at to) they satisfy the said constraint equations [4]. As a consequence, as shown below, the solution of the ADM Euler-Lagrange Equations (65) and (66) is necessarily non unique. The obvious but nevertheless still nontrivial consequence is that the same system of equation does not necessarily coincide identically with EFE, unless a particular choice of initial conditions (66), to be therefore appropriately determined, is actually selected.

The other important remark concerns the customary literature identification of the ADM initial-value problem (65) and (66) with a Hamiltonian system in the proper sense, to which therefore the customary formalism of Hamilton and Hamilton-Jacobi theories should apply. For such an identification to be possible, however, the same ADM initial-value problem should identify a set of first order, evolution ODEs with respect to the ADM variablesGhab,pab,N,Na.

However, such a statement is manifestly incorrect. The reason is as follows. According to the standard notion of Hamiltonian system, the Hamilton equations should depend only on the generalized coordinates and the conjugate momenta, and not contain additional (partial) derivatives of the coordinates. Nevertheless, the ADM Euler-Lagrange equations, just as the original EFE (recalled above by Equation (5)), still contain second-order partial derivatives acting on the Lagrangian ADM variables hab. The crucial term which carries such a kind of contributions is the spatial curvature (3)Rab (in EFE it is the Ricci tensor), which in fact as shown by Equation (59) (see also definition given by Equation (60)) depends on first and second order partial spatial derivatives of hab.

Thus, first one notices that in the ADM Euler-Lagrange Equation (65) (3)Rij depends on hab too, so that

hij(3)R(3)Rij, (67)

contrary to what assumed in the standard ADM approach. Furthermore, there are two equations, namely respectively the second and the third ones, which identify PDEs instead of ordinary differential equations as required for their identification in terms of a Hamiltonian system.

Incidentally, however, there is a further problematic (and dubious) aspect of the ADM approach to be mentioned.

This concerns the issue of the variational treatment of the same 3-tensor represented either in covariant or countervariant representations, that is, (3)Rlm and (3)Rlm. In fact, as indicated above in Equation (59), both representations of the same 3-tensor contains explicit dependencies in terms of first and second-order partial derivatives of hab with respect to the spatial coordinates. In fact, for consistency with the functional setting established above, the variation under the integral (54) should be again performed in terms of the Frechet derivative. This includes the variation of the 3-tensor (3)Rlm (or (3)Rlm). Notice in fact that the same 3-tensor cannot be regarded as constant when it is considered under the integral (see also related discussion in case of Choice A in Section 2.2 above). Furthermore, in view of the previous definition (see Equation (60)), the functional variation of δ(3)Rlm (similarly for (3)Rlm) necessarily takes the form:

δ(3)Rlm=(3)Rlm(δgab,ihabextr,ijhabextr)+(3)Rlm(habextr,iδgab,ijhabextr)+(3)Rlm(habextr,ihabextr,ijδgab), (68)

which therefore depends, besides δgab, also on the first and second-order (spatial) partial derivatives iδgab and ijδgab. On the other hand, for the validity of the variational principle (64) the dependences in terms of the first and second partial derivatives should not appear. Instead, for the validity of the Euler-Lagrange equations themselves reported above (i.e., Equation (65)) all contributions should actually be considered ignorable, namely requiring instead

δ(3)Rab=0, (69)
δ(3)Rab=0. (70)

In both cases this implies that they should satisfy extremal constraint conditions of the type

(3)Rab=(3)Rab(habextr,ihabextr,ijhabextr), (71)
(3)Rab=(3)Rab(habextr,ihabextr,ijhabextr), (72)

with habextr denoting the extremal tensor field hab(r). Nevertheless, since Rab and Rab are related by assumption via the 3-tensor hab(r) (hab(r)) which lowers (raises) their tensor indices, it is obvious that a requirement cannot be consistent with the functional setting of the ADM approach indicated above (4) (see also the analogous discussion in Section 2.3). The alternative, that is, which occurs when these dependencies are retained, has the so-to-sayunpleasant” side-effect of simply destroying the validity of the ADM variational principle (64). Indeed, δ(3)Rab gives rise in such a case (through the Frechet derivative δ(3)Rab) to the appearance of first and second-order partial derivatives of the variations δhab.

In conclusion, the suggested implication is that the ADM initial value equations are formally correct only provided the variational contribution of δ(3)Rab and δ(3)Rab are both set to zero (which amounts to introduce a hidden constraint), while—in addition—the contribution of the extremal term (3)Rab((hab)extr) gives anyway rise (as in the case of EFE) to the appearance of second-order partial derivatives of (hab)extr, thus yielding an intrinsically non-Hamiltonian PDE.

We stress that such features cannot be corrected by adopting a perturbation theory, that is, in which the spatial curvature is treated to leading order as constant. In fact perturbative schemes of this type do not converge everywhere (for example near singularities) and therefore do not appear reliable for the establishment of a quantum theory of gravitation based on the ADM formulation of GR.

3.2. An Unconstrained Lagrangian Variational Formulation for the ADM Representation

As anticipated, we now consider the relationship in terms of the unconstrained Lagrangian EH variational principle which is characterized by the corresponding Lagrangian density Lvar(gμν) (see Equation (42)). As we intend to show this provides an alternate route for the prescription of the ADM variables and at the same time a new possible realization of ADM theory.

This is based on the unconstrained formulation of EH theory pointed out above (see Section 2.2) and on the construction of the variational functional SEH(g,g^) (see Equation (42)). The new approach consists now in representing SEH(g,g^) and the corresponding variational Lagrangian density Lvar(g,g^) in terms of ADM variables, thus prescribing the new ADM functional and Lagrangian density according to

SADM(G,g^)SEH(g,g^),LADM(G,g^)Lvar(g,g^). (73)

Here the action functional and variational Lagrangian density (respectively SEH(g,g^) and Lvar(g,g^)) are now both defined with respect to the functional class grU. This feature permits us to fulfill the variational constraint (24), according to which the Ricci tensor is considered extremal, that is, it is set equal to R^μνRμν(g^(r)). Thus, upon invoking the unconstrained EH variational principle (44), this implies identically

δSADM(G,g^)=0. (74)

For clarity and to distinguish it from the standard ADM variational principle indicated above (i.e., Equation (64)), this will be referred to here as the unconstrained ADM Lagrangian variational principle. If one considers the case of vacuum endowed with a non-vanishing cosmological constant, then it follows in particular

LADM(G,g^)=NhLEH(g,g^), (75)

where by construction gμνR^μν=gμνR^μν, so that LEH(g,g^) is given by Equation (43). Hence, in terms of the ADM representation (52) it follows:

gμνR^μν=1c2(N2+NaNa)R^00+2cNiR^0i+habR^ab. (76)

To determine the corresponding Euler-Lagrange equations we stress that, unlike the variational principle (64), the calculations must now be performed with respect to the functional class g(r)U so that they exactly coincide with EFE (i.e., Equation (5) upon setting Tμν=0), thus yielding

Rμν12RΛgμν=0, (77)

where by construction Rμν=R^μν, R=R^ and gμν=g^μν. The required Euler-Lagrange equations for the extremal ADM fields are then simply:

R^0012R^Λ1c2(N2+NaNa)=0,R^0a12RΛ1cNa=0,R^ab12R^Λhab=0, (78)

which therefore depend only on the Lagrangian ADM fields D(hab,N,Na). Here it is understood that the representation of the extremal Ricci tensor must be expressed as well in terms of ADM variables through the customary definition of Christoffel symbols in terms of the metric tensor. We notice here that:

  • The Euler-Lagrange Equation (78) coincide with EFE (see Equation (5)) when expressed in terms of the Lagrangian ADM fields D(hab,N,Na).

  • The extremal solution Dextr(hab,N,Na)extr coincides necessarily with Dextr(h^ab,N^,N^a), where h^ab,N^ and N^a denote the ADM fields expressed in terms of g^(r), that is, the background metric tensor and the solution of EFE. Therefore, by inversion of Equation (52), their representation in terms of g^(r) is necessarily provided by
    N^(r)=cg^0a(r)g^0a(r)g^00(r),N^a(r)=cg^0a(r)=cg^a0(r),h^ab(r)=g^ab(r). (79)
  • Since the extremal Ricci tensor R^μν depends on second-order partial derivatives of g^(r), Equation (78) must be intended again, in analogy to EFE, as implicit PDEs for the extremal Lagrangian ADM fields Dextr(h^ab,N^,N^a).

Let us now address the issue of the relationship between EFE and the same ADM Euler-Lagrange equations. This is done by requiring the simultaneous validity of the two ADM variational approaches indicated above and realized respectively according to the constrained and unconstrained Lagrangian variational principles. First, we notice that Equation (79) necessarily provides a particular solution of Equations (65) and (66) in which, however, pab remains apparently undetermined. However, one can show that the second equation in (65) yields a differential equation for pab subject to a suitable initial condition for pab(to). In particular, in the case g^(r) is stationary, by setting pab(to)=0 it follows that pab remains everywhere identically zero and therefore is uniquely determined too.

On the contrary, if different initial conditions (66) are selected then it is obvious that generally the solution of the ADM initial value Equations (65) and (66) may not coincide with g^(r). The proof is immediate. In fact this manifestly occurs if, in contrast with the third equation in (78), the initial condition for hab(t,x), namely hab(to,x), is chosen to be

hab(to,x)g^ab(to,x). (80)

As a consequence it is concluded that the same ADM initial value equations and EFE (see Equation (5)) do not generally coincide. This conclusion is important for two main reasons. The first one is that it rules out a possible misunderstanding (see also Wald’s citation in the previous section) according to which vacuum EFE and the ADM initial value equations should coincide always, that is, independent of the initial conditions. The second one is because it permits us to address the problem of the role of the variational constraints (i.e., Equations (6), (39) and (40)) in ADM theory. In fact, the previous conclusion shows that the same ADM initial value equations do not generally satisfy the variational constraints. However, this happens if proper initial conditions are chosen for the ADM fields.

Finally, despite being covariant with respect to the group of local point transformations (LPT), it should be added that ADM theory is not manifestly-covariant with respect to the same group. As a consequence, it is not objective, so that the relevant equations, and in particular the coordinate time, are generally not preserved when introducing a generic LPT. In other words, the ADM variational variables, either Hamiltonian or Lagrangian ones, do not have a tensorial character. This is similar to what occurs in the case of the Palatini variational approach to GR, where variational variables include the connections, which are not 4-tensors. Nevertheless, in difference with the Palatini approach where the variations of the connections are by themselves 4-tensors together with the resulting Euler-Lagrange equations, the variations of ADM variables and the corresponding Euler-Lagrange equations (i.e., respectively either (65) or (78)) do not have a tensorial character either.

4. Synchronous and Manifestly-Covariant Hamiltonian Theory of GR

Given validity of the outcomes pointed out in previous sections, here an alternative to the non-manifestly covariant, non-gauge invariant and non-Hamiltonian ADM theory is provided. In this regard, it is appropriate to recall what is actually its starting point. This is given by the common belief that is epitomized for example by the original sentence due to Wald [7] according to which: “... a Hamiltonian formulation of a field theory requires a breakup of spacetime into space and time. Indeed the first step...consists of choosing a time function t and a vector field tα (the “flow of time”) on a spacetime such that the (space) surfaces Σt of constant t are spacelike Cauchy surfaces (and) such that tαα1=1.” The underlying philosophy of the whole ADM approach and the same sentence are, however, fundamentally incorrect. First, because in field theory the notion of manifest covariance, and more precisely that of manifestly-covariant Hamiltonian approach, are already well known, being due to deDonder and Weyl [22,23,24,25,26,27,28]. Second because, already in the context of classical and quantum mechanics, the adoption of proper time in place of the coordinate time t is an obvious matter-of-fact method which affords the realization of manifesly-covariant representations of relativistic Hamiltonian systems.

However, manifest covariance requires the prescription of a background spacetime viewpoint by means of a suitable functional setting on the class of varied functions g(r). In fact, in order to establish the very notion of manifest covariance it is obviously necessary to specify in advance the spacetime with respect to which the same property holds. Thus, identifying such a spacetime in terms of a differential manifold Q4,g^(r), the task involves to suitably prescribe the corresponding background metric tensor g^(r) with respect to some prescribed GR-frame rrμ. For definiteness, here the spacetime will be intended as a 4-dimensional time-oriented Riemannian differential manifold. Accordingly, g^(r) determines the geometric properties of spacetime so that, besides raising and lowering tensor indices, it must prescribe also the covariant derivatives and the Christoffel symbols. Such a setting nevertheless does not come as a surprise since it emerges naturally already in the context of the Lagrangian EH theory (see previous discussion in Section 2.3 and Section 3.2).

4.1. The Proper-Time Parametrization of Space-Time

We start defining the notion of proper time in the context of GR and its physical interpretations. Its geometric definition follows by introducing a mapping between a subset of the real axis IR and the same background spacetime Q4,g^(r) of the form

r=r(s), (81)

being r an arbitrary 4-position (“point”) of Q4,g^(r) and sIR a suitable 4-scalar, to be denote here as “proper time”. Therefore, the geometric meaning of s depends on its precise prescription. This is obtained by means of two definitions:

(A) Proper time s

First, identifying s with the arc length ds2=g^μνdrμdrν measured along an appropriate finite-length geodetic defined with respect to the background space-time Q4,g^(r), that is,

C(ro,r1)=rr=r(s),ro=r(so),r1=r(s1),sso,s1, roΣ03,r1Σ13, (82)

Σ03 and Σ13 being two suitable subsets of Q4 (see below). Denoting drμds=tμ(r) the tangent to the curve at r and μ the covariant derivative evaluated with respect to the background spacetime Q4,g^(r), then by construction along the same curve if follows identically that μtμ(r)=0.

(B) Family of geodeticsC(ro,r1)

The family C(ro,r1) is defined in such a way that:

(1) For fixed proper times so and s1 (with so<s1), each curve C(ro,r1)C(ro,r1) has the extrema ro=r(so),r1=r(s1) crossed by the same curve respectively at proper times so and s1. In addition, by assumption: (a) all curves C(ro,r1) belong to the same connected subset Q14 of spacetime Q4,g^(r) which has everywhere the same signature; (b) the lower and upper extrema ro=r(so) and r1=r(s1) belong to two smooth hypersurfaces Σ03 (“lower” boundary) and Σ13 (“upper” boundary) of the subset of Q14. Hence, Q14 is the subset of Q4 having lower and upper boundaries Σ03 and Σ13 where the extrema of all curves C(ro,r1) lie.

(2) For each point r of the said subset of Q14 there is a unique curve C(ro,r1)C(ro,r1) which belongs to it.

(3) Two geodetics of the family never cross each other.

(4) All geodetic curves C(ro,r1) are prescribed in such away to be mono-energetic, that is, so that, in the frame in which ro=r(so) the spacetime is locally flat, the corresponding tangent 4-vectors

tμ(ro)drμdsro (83)

have all the same zero component of the 4-velocity.

The proper-time parametrization (s-parametrization) of spacetime is then realized by parametrizing the background metric tensor g^(r)=g^μν(r) and the canonical tensor fields of the Hamiltonian formulation (see below) xR=gμν,πμν, and in particular the variational tensor field g=gμν, in terms of the proper time s. Thus, at the classical level, the s-parametrization of g^(r) is obtained by means of the representation

g^(r)g^(r(s))g^μν(r(s)). (84)

Instead, the corresponding parametrization of the variational tensor field g=gμν and its conjugate canonical momentum π=πμν is obtained letting

g  g(s)g(r(s),s), (85)
π  π(s)π(r(s),s). (86)

Finally, regarding the physical interpretation, this amounts to assume that the Hamiltonian structure of GR (to be suitably identified) corresponds to disturbances (or signals) of the background metric tensor which propagate in the space-time and occur with finite, that is, subluminal, speed of propagation measured by the proper-time s. Such an interpretation will become, nevertheless, obvious in the quantum formulation where such “signals” are interpreted in terms of massive gravitons (see Ref. [29]).

4.2. The Manifestly-Covariant Hamiltonian Structure of GR

In this section the manifestly-covariant Hamiltonian structure of GR, that is, associated to EFE and developed in Refs. [8,9,10], is recalled. It realizes in the case of the gravitational field the so-called deDonder-Weyl manifestly-covariant canonical representation for continuum fields [22,23,24,25,26,27,28]. To begin with, we recall some of its peculiar characteristics. The first one is the adoption of independent and symmetric Lagrangian variables ggμνgνμ associated with the physical properties of the gravitational field. Notice that the tensor g is to be distinguished from the background metric tensor g^g^μν which determines instead the geometric properties of the space-time and raises/lowers tensor indices [10]. Hereon, for definiteness g^ identifies an in principle arbitrary particular solution of the Einstein field equations, with g being an in principle arbitrary and independent real symmetric tensor. As such g is assumed to belong by assumption to the functional class gU defined above (see Equation (36)) and is therefore not required to satisfy orthogonality conditions of the type (5).

Then, according to Ref. [10] the classical Hamiltonian structure of GR is represented by the set xR,HR, formed by the s-parametrized canonical state xR(s)(g(s),π(s)) and a suitable classical Hamiltonian density HR. We remark that here the tensor properties of the canonical fields g(s)=gμν(s),π(s)=πμν(s) and the Hamiltonian density are prescribed in terms of the background metric tensor g^(r(s)), the first ones being identified with real second order tensor fields and the second one as a real 4-scalar. Then, introducing the Poisson bracket A,B=AgμνBπμνAπμνBgμν, by construction the canonical state xR(s) fulfills a corresponding set of continuum Hamilton equations

dgμνds=gμν,HR=HRπμν,dπμνds=πμν,HR=HRgμν, (87)

which satisfies the initial-value condition

xR(so)(gμν(so),πμν(so)), (88)

where gμν(so) and πμν(so) denote two initial conjugate tensor fields and so is the initial proper-time. Hereon a solution of the initial value problem (87) and (88) will be identified with the symbol x¯R(s) and referred to as “extremant” of the Hamiltonian structurexR,HR. As a consequence, the same equations can be viewed as canonical evolution equations for xR(s). As such, provided the Hamiltonian density HR is sufficiently regular (see below), the same equations determine uniquely the proper-time evolved canonical state xR(s) in terms of an (in principle arbitrary) initial condition of the type (88).

Here the notations are standard according to Ref. [30] (see also Refs. [10,29]). Thus, while s is—as stated above—the proper-time along an arbitrary field geodetics C(ro,r1) which belongs to r=r(s) at proper-time s, the differential operator

dds=ddss+ddsr (89)

identifies the covariant s-derivative operator. In particular, ddsstα^α is the directional covariant derivative, with tα=drα(s)ds being the local tangent to the field geodetics, ^α the covariant derivative defined with respect to the background metric tensor g^(s) and ddsr denoting the corresponding covariant s-partial derivative prescribed according to the same reference. Furthermore, the Hamiltonian density HR is identified with the function

HRTR(π,g^)+V, (90)
V(g,g^,r(s),s)Vo+VF, (91)

where TR(π,g^) and V(g,g^,r(s),s) denote the effective kinetic and the normalized effective potential densities, with Vo and VF being respectively the vacuum and non-vacuum (or external) contributions. Thus, in detail, the first one is by construction a function of the canonical momenta only and takes the form

TR(π,g^)12αLπμνπμν, (92)

with α and L being suitably-prescribed dimensional constant 4-scalars identified according to the treatment given in Ref. [29]. The prescription of the vacuum potential Vo follows from analogous one obtained in the case of the asynchronous unconstrained Lagrangian variational principle (44). This implies for the Ricci tensor to be considered extremal, that is, of the form (39) and (40) respectively for its covariant and counter-variant components. The corresponding representation for Vo is therefore given by

Vo(g,g^,s)hαLgμνR^μν2Λ. (93)

Here, Λ identifies the cosmological constant and h(g,g^) is a suitable 4-scalar variational weight-factor defined as

h=214gαβgμνg^αμg^βν

and finally all hatted quantities, and in particular the Ricci tensor R^μν, are intended as evaluated in terms of the background field tensor g^. Thus, explicit evaluation on the rhs of Equation (87) yields

dgμνds=πμναL,dπμνds=Vgμν, (94)

where the first equation prescribes the canonical momentum πμν in terms of the “generalized velocity” dgμνds, while the second one provides a dynamical equation for the canonical momentum. The connection with EFE follows in straightforward way by setting the initial condition

xRso=gμν(so)g^μνso,πμν(so)π^μνso=0, (95)

with g^μνso denoting the background metric tensor evaluated at so. In fact, the initial condition x¯R(so)xRso must necessarily be such that

V(g,r(s),s)gμνg=g^,s=so=0, (96)

with π^μνso=0 being the corresponding initial null canonical momentum. Then, for all s>so, xRs=gμν(s)g^μνs,πμν(s)π^μνs0, with g^μνs being for all (r(s),s) solution of the stationary equation:

V(g,r(s),s)gμνg=g^=0. (97)

Indeed, the initial conditions (95) together with Equation (96) imply that dπμνdss=so=0.

The proof is straightforward. First, we notice that Equation (96) necessarily follows by setting the initial condition (95). In fact, the same initial condition g(so)g^so requires that g(so) must be a solution of EFE (see, e.g., Equation (5)), while on the other hand elementary algebra shows that Equation (96) necessarily coincides with

R^μν(so)12R^(so)g^μν(so)+Λg^μν(so)=8πGc4T^μν(so), (98)

namely EFE evaluated at r(so)=ro along the geodesic trajectory C(ro,r1). As a consequence, it follows by construction also that dπμνdsso=0. Hence, this means that πμν(s) is stationary and null also for s>so, while gμν(s) remains stationary too, being therefore determined by Equation (97) which manifestly coincides with

R^μν(s)12R^(s)g^μν(s)+Λg^μν(s)=8πGc4T^μν(s), (99)

that is, nothing else than EFE evaluated at the proper time s, and again along the same geodesic trajectory C(ro,r1).

As a consequence, it is concluded that the canonical Equation (87) indeed recover EFE upon setting the appropriate initial conditions (95). More precisely, comparison with the ADM approach to the Hamiltonian theory of GR permits to establish the following features:

  • The evolution Equations (87) and (88) are uniquely associated to EFE. In fact, the same equations reduce to EFE by prescribing appropriate initial conditions in which the initial canonical momentum vanishes. For the initial condition (95) such a solution is unique. In addition the same equations are truly Hamiltonian in character. In fact the evolution Equations (87) and (88) define a true Hamiltonian system because, unlike the ADM approach, no unwanted hidden non-local dependences (which actually would destroy the same Hamiltonian structure) appear in the Hamiltonian density. Basic consequence is therefore that the set xR,HR can be identified as a possible candidate for the Hamiltonian structure of GR.

  • The Hamiltonian system xR,HR is constraint-free because, in contrast to the ADM approach, the tensor components of the Hamiltonian state xR(s)(g(s),π(s)) are independent. In fact, the requirement which permits to recover EFE is merely the natural dynamical constraint which arises due to the initial conditions. In addition, no boundary constraint conditions are set in such a framework. The same Hamilton structure, unlike in the ADM approach, is preserved also in validity of the initial condition (95) which actually permits to recover EFE.

  • Finally, the most important feature which emerges in a perspicuous way by inspection of Equations (87) and (88) is that the Hamiltonian system xR,HR is manifestly covariant, that is, it is set in 4-tensor form, since both xR and HR have a 4-tensor and therefore frame-independent character with respect to the background spacetime Q4,g^(r).

4.3. Synchronous Path-Integral Hamiltonian Variational Principle

Let us now address the issue of casting the continuous Hamilton Equation (94) in variational form, that is, in terms of a suitable realization of the corresponding variational principle. For its close similarity with the standard theory of Hamiltonian dynamical systems, an obvious answer to this question is provided by what can be referred to as a Hamiltonian variational principle. Let us introduce for this purpose the path-integral Hamiltonian functional (see also Ref. [31])

J(xR(s))=sos1dsLRgs,dgsds,s, (100)

where

LRgs,dgsds,s=πμνsdgμνsdsHRxRs,s, (101)

and dds is the covariant derivative (89) acting on a tensor function of the form fμν(r(s),s), which is parametrized as dds=tα(s)^α+s. Furthermore, here LRgs,dgsds,s and HRxRs,s are respectively the Lagrangian and the Legendre-conjugate Hamiltonian densities, whereas the canonical state xR, represented in terms of its s-parametrization xRs=gμν(s),πμν(s), belongs by assumption to the synchronous functional class

xRsS=xRxR(s)=x¯Rs+δxRs,xR(si)=xRi,xRiΣi3,i=0,1. (102)

Here x¯Rs denotes the general solution of the initial value problem (87) and (88), that is, a so-called extremant of the Hamiltonian structurexR,HR, while the boundary conditions xRs1 is not considered independent of xRso. Finally, here δxRs denotes the synchronous variation, that is, an arbitrary real tensor function δxRsδgμν(s),δπμν(s) such that

δxRsi=0 (103)

for s=so and s1, while

δs=0 (104)

for all sso,s1. The connection with the canonical Equation (87) can then be established at once, as it follows from the Hamiltonian variational principle

δJ(xR(s))=0, (105)

where the variation operator δ denotes again the Frechet derivative and the variations δxR(s) are considered arbitrary and independent, namely

δJ(xR)ddαJ(x¯R(s)+αδxR(s))α=0. (106)

For the precise evaluation of (105) one notices that by construction the general solution x¯R(s) is independent of the parameter α. Hence, its synchronous variation necessarily vanishes identically, so that δx¯R(s)ddαx¯R(s)α=00. As a consequence, also for the particular solution represented by the background metric tensor g^s the corresponding synchronous variation is vanishing identically, that is, δg^s0. The establishment of the corresponding Euler-Lagrange equations follows in terms of the functional derivatives of J(xR(s)):

δJ(xR(s))δπμν(s)=0,δJ(xR(s))δgμν(s)=0. (107)

For this purpose one notices that the following differential identity applies

πμνsdgμνsds=ddsπμνsgμνsgμνsdπμνsds, (108)

which implies that

sos1dsddsπμνsgμνs=πμνsgμνss1s1 (109)

is a gauge contribution (to the variational functional J(xR)) which, by construction (i.e., thanks to the prescription of the synchronous functional class xRsS), does not contribute. As a result, the corresponding Euler-Lagrange equations follow at once, being identical to the Hamilton equations indicated above (see Equation (87)) and with solutions subject to the boundary conditions xR(si)=xRi (for i=0,1). The latter ones can then be shown to be necessarily equivalent to (88). However, from direct inspection of the path-integral Hamiltonian variational principle (105), a number of consequences follow which are unprecedented, especially in connection with the ADM theory. More precisely these are represented by the following additional notable features:

  • The path-integral Hamiltonian variational principle is set in a manifestly-covariant form.

  • The path integral in the functional J(xR) is performed along a generic finite-length field geodetics C(ro,r1) so that xRs is parametrized in terms of it, namely letting xRsxRr(s),s while the Hamiltonian density is of the general form HRHRxRs,r(s),s.

  • The tensor components of the variation δxR are considered independent. For this reason the variational principle (105) can be referred to as constraint-free.

  • The variation operator δ in the path-integral variational principle defined above (106) is synchronous, that is, it is such that it leaves invariant both the proper time s and the field geodetics C(ro,r1) so that identically
    δ(ds)0δrs=0, (110)
    and furthermore, by construction, it leaves invariant also the corresponding boundary conditions, so that
    δxRso=δxRs1=0. (111)
  • Both the Hamiltonian and Lagrangian densities HRxRs,s and LRgs,dgsds,s are intrinsically non-unique, being necessarily determined up to an additive gauge function of the form
    ddsF(gs,r(s),s), (112)
    with F(gs,s) denoting a real arbitrary 4-scalar field of class C(2). In fact, the term
    ΔJ(xR(s))=sos1dsddsF(gs,r(s),s)F(gs,r(s),s)sos1 (113)
    has by construction null variation since
    δF(gs,r(s),s)sos10. (114)

    As a consequence, the path-integral Hamiltonian variational principle (105) is gauge invariant.

4.4. A Hypersurface-Integral Hamiltonian Variational Principle

The variational principle reported above, that is, Equation (105), applies to a single (albeit arbitrary) geodesic trajectory C(ro,r1). However, the extension to include the whole set of trajectories C(ro,r1) is actually obvious. In fact, the family of geodesic curves C(ro,r1) of Q4,g^(r) generates by assumption for all sso,s1 and all roΣ03Q14Q4 a flow, that is, a map

ro=r(so)r=r(s)χ(ro,so,s), (115)

which is in turn generated by a classical dynamical system (i.e., a bijection). Thus, the path integral functional (63) can be integrated on the set Σ03 (a 3D subset of Q14Q4, that is, the hypersurface on which s=so) yielding

S(xR)=sos1dsQ4dΩoδΣ3(ro)πμνsdgμνsdsHRxRs,s, (116)

which is now defined with respect to the functional class

xRsΣ3=xRxR(s)=x¯Rs+δxRs,xR(si)=xRi,xRiΣi3,i=0,1, (117)

where x¯Rsg¯μνs,π¯μνs denotes the general solution of the Hamiltonian Equation (107) (see equivalently Equation (120) below), which includes as a particular solution also g^sg^μνs. Furthermore, the tensor variation δxRs is assumed once again smooth and everywhere bounded for all sso,s1 so that again Equations (110) and (111) hold. Here furthermore

dΩo=d4rog^(ro) (118)

denotes the invariant volume element of the spacetime Q4,g^(r), and therefore evaluated with respect to the background metric tensor at the position ro. As a consequence, by construction manifestly δdΩo=0. Moreover, δΣ3(ro) is the characteristic function on the ensemble Σ3, that is, defined so that δΣ03(ro)=1 if roΣ03 and 0 otherwise. We stress that in the integrand the canonical state xR(s)gμνs,πμνs is considered a composite function of ro through r(s)=χ(ro,so,s). Thus, x¯Rs, g^s, gμνs (and similarly πμνs) are treated as functions of ro since g^μνsg^μνr(s)=χ(ro,so,s) and respectively gμν(s)gμνr(s)=χ(ro,so,s),s.

Once again the connection with the canonical Equation (87) can be formally established just as in the path-integral approach. This follows of course from the (volume-integral) synchronous Hamiltonian variational principle

δS(xR)ddαS(x¯R(s)+αδxR(s))α=0=0, (119)

the components of the tensor variations δxR(s) being arbitrary and independent also in this case. Furthermore, again the same considerations indicated above (see Section 4.3) apply here regarding the synchronous variations of x¯R(s) and g^s. The corresponding symbolic Euler-Lagrange equations given by the variational derivatives then coincide with the canonical Equation (87)

δS(xR(s))δπμν(s)=dgμνdsTRπμν=0,δS(xR(s))δgμν(s)=dπμνdsVgμν=0, (120)

which are subject (again) only to the initial conditions (88).

The variational principle (119) is obviously not independent from the one given above in term of the path-integral (100). So, the same conclusions discussed above actually apply. Nevertheless, additional peculiar aspects can be inferred. The first one concerns the synchronous character of the volume-integral variational principle. We stress in fact that the variation operator δ which appears in the variational principle (119) is again of the synchronous type. In fact, besides leaving invariant the proper time element ds and the proper time s, it leaves invariant by construction also the volume element dΩo. As a further remark, however, we also notice that the same operator δ leaves invariant by construction also the generic volume element centered at r=r(s), namely dΩ(s)=d4rg^(r(s)), because δg^(r)=δrμμg^(r) and, again by construction, μg^(r)0 since the Christoffel symbols in the covariant derivative are by assumption defined with respect to the same g^(r). Finally, in the functional (116) the integration is actually carried out on a 3-dimensional subset of spacetime, that is, a hypersurface Σ03. This feature seems interesting because it may be suggestive of its possible physical interpretation as a “source” of the geodetics.

4.5. A Volume-Integral Hamiltonian Variational Principle

An equivalent realization of the variational principle, alternative to those reported above, that is, (105) and (119), can be achieved upon parametrizing the canonical state as xRr,s=gμνr,s,πμνr,s. As a result, the Lagrangian coordinates and canonical momentum now become of the form

gμνr,s=g¯μνr,s+δgμνr,s,πμνr,s=π¯μνr,s+δπμνr,s. (121)

Thus, denoting by Q14 the boundary of Q14, the following functional setting is adopted

xRr,sQ14=xRxR(r,s)=x¯Rr+δxRr,s,δxRr,srQ14=0,δxRr,si=0,i=0,1, (122)

with δxRr,s being assumed again smooth and everywhere bounded on Q14 and the boundary conditions on Q14 being set so that

δgμνr,srQ14=0,δπμνr,srQ14=0. (123)

In this case the relevant variational functional can be identified with the action functional

Q(xR(r,s))=1sos1dssos1dsQ14dΩπμνr,sdgμνr,sdsHRxRr,s,s, (124)

with dΩ being the volume element evaluated with respect to the background spacetime, that is, dΩ=d4rg^(r), and the covariant derivative being prescribed according to Equation (89). Notice that in the functional setting (122) the tensor variation δxRr,s are required to be continuous so that in accordance with the boundary conditions (123), δxRr,s must tend smoothly to zero on the boundary Q14. This implies therefore by construction that δQ14dΩtα(r)αgμνr,sπμνr,s=0 so that the following identity

δsos1dsQ14dΩπμνr,sdgμνr,sds+gμνr,sdπμνr,sds=δQ14dΩπ¯μνr,sg¯μνr,ssos1=0 (125)

necessarily holds. Then, straightforward algebra displays the connection with the canonical Equation (87). This follows upon invoking that the synchronous variational principle

δQ(xR(r,s))ddαQx¯R(r,s)+αδxR(r,s)α=0=0 (126)

holds for arbitrary independent variations δxRr,s=δgμνr,s,δπμνr,s and the same considerations indicated above (see Section 4.3) apply to the synchronous variations of the general solution x¯R(s) and to the particular solution realized by g^s. Here we notice again that the variation operator is synchronous in the sense that, besides Equations (110) and (111), also the requirement δdΩ=0 holds. Then, the corresponding Euler-Lagrange equations, namely

δJQ(xR(r,s))δπμν(r,s)=0,δQ(xR(r,s))δgμν(r,s)=0, (127)

can be shown to coincide with Equation (87). We remark, finally, that in the case the canonical state is assumed not to depend explicitly on s, then the action functional (124) simply reduces to

Q(xR(r))=Q14dΩπμνrdgμνr,sdsHRxRr, (128)

where the Lagrangian derivative is now identified with the directional covariant derivative dds=tα(r)α, that is, the covariant derivative projected along the tangent to the local geodesic curve belonging to the family Cro,r1.

5. Physical Interpretation

The results presented above permit to clarify the conditions of validity and limits of previous approaches to the Lagrangian EH and Hamiltonian ADM variational principles. The main conclusions are that, with the possible exception of the Palatini approach, some of the previous approaches to the Lagrangian EH theory appear questionable. As far as Hamiltonian theory is concerned, we have shown that the attempt of identifying the ADM variables in terms of true canonical variables fails because of the intrinsic non-Hamiltonian character of the corresponding non-manifestly covariant theory. The reason of the failure is that the Ricci 4-scalar unavoidably carries second-order partial derivative of the same variables, a feature which destroys their Hamiltonian character.

The way-out solution for the Lagrangian approaches to GR is represented respectively by the unconstrained Lagrangian variational principle (44) and the unconstrained Lagrangian variational formulation for the ADM representation (74). Instead, the way-out solution for the Hamiltonian approach is given respectively by the three synchronous Hamiltonian variational principles represented by the path-integral Hamiltonian variational principle (105), the hypersurface-integral Hamiltonian variational principle (119) and the volume-integral Hamiltonian variational principle (126).

We stress that these variational formulations are based on the adoption of constraint-free variational principles together with a suitable prescription of the functional setting. For this purpose the Ricci tensor is considered prescribed in terms of a suitable background metric tensor g^(r), to be identified with a particular solution of EFE, which raises and lowers all tensor indices and determines the geometry of spacetime. This means, more precisely, that in all variational principles referred to above, the contribution gμνR^μν enters the Lagrangian density. The same term, in fact, gives rise to a well-defined contribution to the effective potential energy (see Equation (91) above) which occurs both in the Lagrangian and Hamiltonian densities.

Let us now show how this contribution can be given a well-defined physical interpretations. We first notice that, according to Equations (39) and (40), in the same variational principles R^μν is considered prescribed and independent of the variational field g(r), while gμν is variational. As a consequence, R^μν enters as a linear contribution to the Lagrangian and Hamiltonian densities defined above. Furthermore, in gμνR^μν both terms gμν and R^μν have a tensor character so that the same can be equivalently represented as

gμνR^μνgμνR^μν. (129)

This means that gμνR^μν retains a tensorial character since it behaves as a 4-scalar, a feature that confirms its physically objective character. As a consequence its possible physical interpretation follows as a gravitational coupling term.

That such an interpretation is appropriate is also suggested by analogy with the variational principle of Maxwell equations where the so-called EM coupling term AμJμ occurs. Its properties are similar. In fact, in the case of the EM variational principle Jμ and Aμ are considered respectively prescribed and variational tensor fields, thus warranting the 4-scalar character of the EM coupling term. Moreover, the fact that Jμ is considered constrained means that it is necessarily independent of Aμ, so that the coupling term provides a linear relationship in which only the linear term Aμ is involved in the variations of the field, while Jμ enters linearly the extremal Maxwell equations.

Finally, the fact that the Ricci scalar enters the Lagrangian density function as a coupling term has another interesting physical implication. Since it carries only a linear contribution of the metric tensor, namely the field coordinate, and not of the corresponding generalized velocities dgμνds, it follows that it can be interpreted as a potential term in the Lagrangian (and Hamiltonian) densities. In other words, it contributes only to the effective potential V(g,g^,r(s),s) (see Equation (91) and not to the corresponding effective kinetic energy TR(π,g^) (90).

The conclusion which one infers is therefore the following one: it appears conceptually incorrect to break the manifest covariance character of the Ricci scalar and the Ricci tensor and proceed—as done in ADM theory—by attempting to construct a Hamiltonian theory of GR in terms of non-tensorial terms which appear in the definition of the Ricci tensor. The same non-tensorial terms, besides breaking the manifest-covariance property, retain in fact quadratic partial-derivative terms and second-order derivatives which give rise to the intrinsically non-Hamiltonian character of ADM theory.

Instead, one envisages a manifestly covariant Hamiltonian theory of GR in which the Ricci tensor is set in its extremal form, acting effectively as a classical “potential” via a gravitational coupling term for an extended Lagrangian function. It is therefore not surprising that the abstract Hamiltonian structure of GR pointed out here, and represented by the set xR,HR, exhibits the standard canonical structure of classical mechanics and classical field theory.

6. Conclusions

In this paper, aspects of the variational theory of GR have been investigated. As far as the Lagrangian theory of GR is concerned, based on the Einstein-Hilbert action functional (as discussed in Section 2), the existence of a novel unconstrained Lagrangian variational principle has been pointed out (see Section 2.3 and Section 2.4). Such a variational principle represented in ADM variables (see Section 3.2) has also implications for the physical interpretation and understanding of ADM theory.

The main emphasis has been devoted to the two alternative Hamiltonian representations of GR, here analyzed and compared in detail: namely the ADM and the manifestly-covariant Hamiltonian approaches respectively. The outcome in our view is of potential value for unveiling and weighting the possible physical relevance of the two Hamiltonian representations, also in view of the corresponding quantum formulations. For this purpose the main features, analogies and differences arising between them have been pointed out.

Thus, the ADM theory has been shown to exhibit a number of major deficiencies in particular:

  • The structure of the ADM dynamical equations is not truly Hamiltonian, because the same equations still depend on first and second-order spatial partial derivatives of the relevant non-tensor variational fields. The same dynamical equations may not generally satisfy the variational functional characteristic constraint condition of EH theory pointed out here. We refer in particular to the requirement that the Ricci tensor components are functions of the background metric tensor g^(r). We have shown in fact that the same Hamilton-like structure is not preserved when such a constraint condition is taken into account.

  • ADM theory is not gauge-invariant, a property which is closely related to the nature of the asynchronous variational principle required for its construction.

  • Despite being covariant with respect to an arbitrary local point transformation, ADM theory is not manifestly-covariant.

On the contrary, in radical departure from such a complex and even possibly fundamentally incorrect picture, the manifestly-covariant Hamiltonian theory pointed out here presents itself as a straightforward alternative. With a number of notable features, in particular:

  • The relevant dynamical equations for the tensor fields are realized by means of evolution-type ODEs which identify a Hamiltonian system in a proper sense. Such a feature is warranted by the fulfillment of the constraint condition characteristic of the Einstein field equations, namely the requirement that the components of the Ricci tensor depend on the background metric tensor g^(r) only.

  • The corresponding Hamiltonian system xR,HR is constraint-free because, in contrast to the ADM approach, the tensor components of the Hamiltonian state xR(s)(g(s),π(s)) are independent.

  • The Einstein field equations are determined in terms of a particular solution of the initial-value problem associated with suitable Hamilton equations for the relevant 4-tensor fields. The solution corresponding to such an initial condition, requiring the initial canonical momentum to vanish, is unique. The same Hamiltonian structure, unlike in the ADM approach, is preserved also in validity of the initial condition which actually permits to recover the Einstein field equations.

  • The Hamilton equations are determined via a synchronous variational principle which leaves invariant both the proper-time which parametrizes the same equations and the 4-scalar volume element.

  • The theory is gauge invariant. As a consequence both the Hamiltonian and Lagrangian densities are intrinsically non-unique, being determined up to an additive gauge function.

  • The Hamiltonian system xR,HR is manifestly covariant, that is, it is set in 4-tensor form, since both xR and HR have a 4-tensor and the frame-independent character with respect to the background spacetime Q4,g^(r).

To conclude, let us summarize the physical interpretation of the manifestly-covariant Hamiltonian structure of GR determined above and how it compares to the ADM theory. The first striking difference is that the manifestly-covariant approach truly realizes a Hamiltonian structure. In fact in such a context the non-local dependencies characteristic of ADM theory, which make it intrinsically non-Hamiltonian, are not extant any more. The ADM dynamical structure appears as a property of the representation of the Ricci tensor, where h·ab is identified with the generalized velocity, but this does not define a true symplectic structure of GR. Curiously, the claimed ADM Hamiltonian structure is generated by the Ricci tensor Rμν that appears coupled to the variational tensor field gμν in the EH Lagrangian, even though it is proved that the variations of Rμν should not actually contribute to EFE. The reason why the ADM velocity h·ab appears is because Rμν is coupled to gμν in the Lagrangian. Therefore, the ADM dynamical theory arises indirectly through the variations of gμν. In contrast, in the new theory the Ricci scalar is interpreted as a coupling term in the Lagrangian density, which couples the metric tensor gμν (i.e., the physical field or generalized coordinate) with the Ricci tensor Rμν which determines the geometry of spacetime and is considered extremal, namely identified with R^μν, that is, expressed in terms of the background metric tensor g^(r). The basic implications is therefore that the set xR,HR can be naturally identified as possible candidate for the classical Hamiltonian structure of GR.

The same features indicated above, on the other hand, should be considered as an obvious requirement for the establishment of canonical quantization theory of gravity and, in particular, the formulation of a manifestly covariant theory of quantum gravity. And indeed the implementation of such a task, referred to as CQG-theory, has been recently successfully achieved [29,30,32,33,34]. In fact, CQG-theory represents a new, fertile and promising field of theoretical research in mathematical physics and quantum field theory.

Several questions, not treated in this paper, however, remain still open and are left to future investigations. One undoubtedly concerns the excruciating long issue whether, despite all difficulties and inconveniencies pointed out here, a physically meaningful quantum theory can still be based on ADM theory and more specifically on the Wheeler-DeWitt quantum wave equation.

Author Contributions

Conceptualization, M.T. and C.C.; methodology, M.T. and C.C.; writing—original draft preparation, M.T. and C.C.; writing—review and editing, M.T. and C.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

Footnotes

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Hawking S.W. In: General Relativity. An Einstein Centenary Survey. Hawking S.W., Israel W., editors. Cambridge University Press; Cambridge, UK: 1979. [Google Scholar]
  • 2.Isham C. Canonical quantum gravity and the problem of time. In: Ibort L.A., Rodriguez M.A., editors. Integrable Systems, Quantum Groups, and Quantum Field Theory. Springer; Dordrecht, The Netherlands: 1993. [Google Scholar]
  • 3.Arnowitt R., Deser S., Misner C.W. Dynamical Structure and Definition of Energy in General Relativity. Phys. Rev. 1959;116:1322. doi: 10.1103/PhysRev.116.1322. [DOI] [Google Scholar]
  • 4.Arnowitt R., Deser S., Misner C.W. In: Gravitation: An Introduction to Current Research. Witten L., editor. Wiley; New York, NY, USA: 1962. [Google Scholar]
  • 5.Landau L.D., Lifschitz E.M. Field Theory, Theoretical Physics. Volume 2 Addison-Wesley; New York, NY, USA: 1957. [Google Scholar]
  • 6.Misner C.W., Thorne K.S., Wheeler J.A. Gravitation. 1st ed. W.H. Freeman; San Francisco, CA, USA: 1973. [Google Scholar]
  • 7.Wald R. General Relativity. University of Chicago Press; Chicago, IL, USA: 1984. [Google Scholar]
  • 8.Cremaschini C., Tessarotto M. Synchronous Lagrangian variational principles in General Relativity. Eur. Phys. J. Plus. 2015;130:123. doi: 10.1140/epjp/i2015-15123-4. [DOI] [Google Scholar]
  • 9.Cremaschini C., Tessarotto M. Manifest covariant Hamiltonian theory of General Relativity. Appl. Phys. Res. 2016;8:2. doi: 10.5539/apr.v8n2p60. [DOI] [Google Scholar]
  • 10.Cremaschini C., Tessarotto M. Hamiltonian approach to GR—Part 1: Covariant theory of classical gravity. Eur. Phys. J. C. 2017;77:329. doi: 10.1140/epjc/s10052-017-4854-1. [DOI] [Google Scholar]
  • 11.Einstein A. The Meaning of Relativity. Princeton University Press; Princeton, NJ, USA: 2004. [Google Scholar]
  • 12.Lang S. Differential and Riemannian Manifolds. Springer; Berlin/Heidelberg, Germany: 1995. [Google Scholar]
  • 13.Palatini A. Deduzione invariantiva delle equazioni gravitazionali dal principio di Hamilton. Rend. Circ. Mat. Palermo. 1919;43:203–212. doi: 10.1007/BF03014670. [DOI] [Google Scholar]
  • 14.Tessarotto M., Cremaschini C. Theory of Nonlocal Point Transformations in General Relativity. Adv. Math. Phys. 2016;2016:9619326. [Google Scholar]
  • 15.Schroedinger E. Space-Time Structure. Cambridge University Press; Cambridge, UK: 1950. [Google Scholar]
  • 16.York J.W. Boundary terms in the action principles of general relativity. Found. Phys. 1986;16:249–257. doi: 10.1007/BF01889475. [DOI] [Google Scholar]
  • 17.Chakraborty S. Boundary terms of the Einstein-Hilbert action. Fundam. Theor. Phys. 2017;187:43–59. [Google Scholar]
  • 18.Oliveri R., Speziale S. Boundary effects in General Relativity with tetrad variables. Gen. Relativ. Gravit. 2020;52:83. doi: 10.1007/s10714-020-02733-8. [DOI] [Google Scholar]
  • 19.Dirac P.A.M. Generalized Hamiltonian dynamics. Proc. R. Soc. A. 1958;246:326–332. doi: 10.4153/CJM-1950-012-1. [DOI] [Google Scholar]
  • 20.Dirac P.A.M. The theory of gravitation in Hamiltonian form. Proc. R. Soc. A. 1958;246:333–343. [Google Scholar]
  • 21.Sundermeyer K. Constrained Dynamics. Springer; Berlin/Heidelberg, Germany: 1982. Lecture Notes in Physics. [Google Scholar]
  • 22.De Donder T. Théorie Invariantive Du Calcul des Variations. Gaultier-Villars & Cia.; Paris, France: 1930. [Google Scholar]
  • 23.Weyl H. Geodesic Fields in the Calculus of Variation for Multiple Integrals. Ann. Math. 1935;36:607–629. doi: 10.2307/1968645. [DOI] [Google Scholar]
  • 24.Saunders D.J. The Geometry of Jet Bundles. Cambridge University Press; Cambridge, UK: 1989. [Google Scholar]
  • 25.Sardanashvily G. Generalized Hamiltonian Formalism for Field Theory. World Scientific Publishing; Singapore: 1995. [Google Scholar]
  • 26.Echeverría-Enríquez A., Muñoz-Lecanda M.C., Román-Roy N. Geometry of Lagrangian first-order classical field theories. Fortschr. Phys. 1996;44:235–280. doi: 10.1002/prop.2190440304. [DOI] [Google Scholar]
  • 27.Echeverría-Enríquez A., Muñoz-Lecanda M.C., Román-Roy N. Geometry of multisymplectic Hamiltonian first-order field theories. J. Math. Phys. 2000;41:7402–7444. doi: 10.1063/1.1308075. [DOI] [Google Scholar]
  • 28.Struckmeier J., Redelbach A. Covariant Hamiltonian Field Theory. Int. J. Modern Phys. E. 2008;17:435–491. doi: 10.1142/S0218301308009458. [DOI] [Google Scholar]
  • 29.Cremaschini C., Tessarotto M. Hamiltonian approach to GR—Part 2: Covariant theory of quantum gravity. Eur. Phys. J. C. 2017;77:330. doi: 10.1140/epjc/s10052-017-4855-0. [DOI] [Google Scholar]
  • 30.Cremaschini C., Tessarotto M. Space-time second-quantization effects and the quantum origin of cosmological constant in covariant quantum gravity. Symmetry. 2018;10:287. doi: 10.3390/sym10070287. [DOI] [Google Scholar]
  • 31.Cremaschini C., Tessarotto M. Classical variational theory of the cosmological constant and its consistency with quantum prescription. Symmetry. 2020;12:633. doi: 10.3390/sym12040633. [DOI] [Google Scholar]
  • 32.Cremaschini C., Tessarotto M. Quantum-wave equation and Heisenberg inequalities of covariant quantum gravity. Entropy. 2017;19:339. doi: 10.3390/e19070339. [DOI] [Google Scholar]
  • 33.Tessarotto M., Cremaschini C. Generalized Lagrangian path approach to manifestly-covariant quantum gravity theory. Entropy. 2018;20:205. doi: 10.3390/e20030205. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Tessarotto M., Cremaschini C. The Heisenberg Indeterminacy Principle in the Context of Covariant Quantum Gravity. Entropy. 2020;22:1209. doi: 10.3390/e22111209. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Entropy are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES