Abstract
This paper studies the dynamics and conservation of energy. It evaluates the validity of Noether's theorem as a formal argument supporting the law of conservation of energy in physical systems. The analysis examines the role of nonconvexity in energy dynamics. The paper argues that nonconvexity can arise in the presence of catalytic effects or in situations of transitions between multiple regimes. With the introduction of nonconvexity, the analysis relies on a generalized Lagrangian and generalized Hamiltonian. The investigation applies under general conditions, allowing for multiple types of energy with dynamics driven by multiple state variables. Our key result is to show that the conservation of energy (Noether's theorem) holds under convexity but not under nonconvexity. This identifies situations where energy in isolated systems is not necessarily constant over time. By relaxing the law of conservation of energy, our analysis provides new insights into energy dynamics. It offers new directions for scientific inquiries, including improved understanding about the origin of life, the evolution of the early universe and the nature of space and time.
Keywords: Energy conservation, Noether's theorem, Nonconvexity, Hamiltonian
Significance: This paper studies the conservation of energy in the dynamics of physical systems, with a focus on the validity of Noether's theorem. The paper examines the role nonconvexity relying on a generalized Lagrangian and generalized Hamiltonian. The investigation applies under wide-ranging conditions, allowing for multiple types of energy with dynamics driven by multiple state variables. It shows that the law of conservation of energy (Noether's theorem) no longer holds under nonconvexity. Nonconvexity can arise in the presence of catalytic effects or in situations of transitions between multiple regimes. This identifies situations where energy in isolated systems is not necessarily constant over time. By relaxing the law of conservation of energy, our analysis provides new insights into energy dynamics. It offers new directions for scientific inquiries, including improved understanding about the origin of life, the evolution of the early universe and the nature of space and time.
1. Introduction
The conservation of energy has been seen as a fundamental law of nature. It states that energy in an isolated system can neither be created nor destroyed: it can only be transformed from one form to another. The law of energy conservation applies in general, going from Newtonian mechanics, to classical, relativistic and quantum mechanics. Arguments supporting the conservation of energy come from two sources. First, so far, there is no experiment showing strong empirical evidence against it. Second, the conservation of energy has been formally proven by Noether's theorem when the laws of physics do not change [1]. But there are many forms of energy, including mechanical, chemical, electrical, thermal and nuclear energy. This makes the assessment of energy transformations from one form to another somewhat difficult. The analysis of energy dynamics has typically focused on two types of energy: potential energy that is stored in objects in given position, and kinetic energy associated with motions. From Noether's theorem, a simple proof that energy is conserved can be obtained when the kinetic energy is a quadratic function of velocities and the potential energy does not depend on velocity ([2], p. 188). But could there be conditions under which energy conservation does not apply? Note that this question is not new: previous literature has addressed this issue by investigating how the arguments depend on how the conservation law is defined and the nature of energy dynamics [[3], [4], [5]]. This paper contributes to these inquiries with a focus on revisiting Noether's theorem under general conditions. Our analysis is general in the sense that it applies to multiple types of energy with dynamics involving multiple state variables. In this context, we show that Noether's theorem holds but only under convexity assumptions.
Our analysis of energy dynamics builds on three basic concepts that have been at the heart of physics and its historical development [[6], [7], [8], [9], [10]]:
C1: the time-invariance of the laws of physics,
C2: the principle of least action,
C3: the conservation of energy (Noether's theorem under C1).
The time invariance of the laws of physics (condition C1) applies to closed systems and assumes that dynamics of energy in an isolated system should be self-contained. The principle of least action (condition C2) assumes that the evolution of a physical system behaves in a way that is consistent with minimal effort [7,11,12]. Finally, the conservation of energy (C3) assumes that energy can neither be created nor destroyed; rather, it can only be transformed or transferred from one form to another [6,13]. Under conditions C1 and C2, our analysis examines the evolution of a closed physical system with a focus on the role of nonconvexity and its implications for condition C3.
In a minimization problem motivated under the least action principle (C2), it is typically assumed that the function being minimized is convex.1 Arguments supporting convexity are largely consistent with theoretical physics and its historical developments [2,[6], [7], [8], [9], [10]]. For example, kinetic energy is typically assumed to be a quadratic and convex function of velocities. Yet, there are at least two situations where convexity may not hold. The first situation arises in the presence of catalytic effects, where a small increase in a catalytic factor can generate a large positive effect on the objective function but only within a limited range, suggesting that catalytic factors would be associated with a non-convex objective function. The second situation involves dynamic systems displaying multiple regimes, where the convexity property holds within each regime but not in transitions between regimes. In this case, convexity would hold locally but not during transitions from one regime to another.
This paper examines the dynamics of energy when convexity may not apply. Under condition C1 and C2, the analysis is presented as a dynamic optimization problem and relies on optimal control methods. Under convexity, the approach makes use of Lagrangian and Hamiltonian as dual representations of the problem [14,15]. Theoretical physics has relied extensively on such dual formulations [2,[8], [9], [10]]. Our paper introduces nonconvexity in the analysis with an important difference: the dual formulation of the problem relies on a generalized Lagrangian and generalized Hamiltonian [16]. Under this generalized dual formulation, standard Lagrange multipliers no longer apply and are replaced by nonlinear functions capturing the effects of nonconvexity on dynamics. In this context, we explore how nonconvexity affects the conservation of energy (condition C3). When the laws of physics do not change over time, we show that the conservation of energy (as commonly stated in physics theory [1,6,13]) holds but only under convexity. Our analysis documents that the introduction of nonconvexity has significant effects on energy dynamics: in situations where nonconvexities arise, Noether's theorem does not hold and the law of conservation of energy no longer applies. Implications of our results are discussed.
2. Research methods
Consider an isolated physical system that evolves over time. As noted in the introduction, the analysis is applied to a closed system under condition C1 (time-invariance of the laws of physics) and condition C2 (principle of least action). In this context, the system dynamics is represented by a constrained minimization problem, with a focus on the evolution of energy over time. In turn, the optimal dynamics have a dual representation typically captured using a Lagrangian-Hamiltonian formulation. Such dual formulations are commonly found in classical mechanics [8,9,17], in special and general relativity [10] as well as in quantum mechanics [18].
At time , consider a physical system involving state variables and control variables , where , and denoting initial time and terminal time, respectively.2 For a given initial condition , assume that evolves over time according to the differential equation:
| (1) |
where and , . Under conditions C1 and C2, consider the minimization problem:
| (2) |
where and , and . The function measures energy in the system at time and measures total energy over the period . The functions and are assumed to be independent of time. This is consistent with condition C1: the time-invariance of the laws of physics. Also, the minimization problem in (2) is consistent with the principle of least action C2 [7,11,12].
Throughout the paper, we assume that the functions and are differentiable. Let } be the feasible set in (2). We also assume that has a nonempty interior and that the set : has a lower bound, thus guaranteeing that problem (2) has a solution. Denote its solution by , where is a piecewise continuous optimal control path and is the corresponding optimal state trajectory.
Note the generality of the approach. Given a system represented by the -dimensional state vector , the function in (2) provides a general measurement of the energy-momenta of the system at time and the function in equation (1) provides a general representation of energy dynamics. This allows for multiple types of energy with dynamics involving multiple state variables. As noted in the introduction, we do not assume that the function is convex, nor that the feasible set is a convex set. As discussed below, allowing for nonconvexity has important implications in our analysis.
To obtain insights into energy dynamics, we seek to investigate a dual formulation to (2). Let where is the class of absolutely continuous functions satisfying , and with being continuous over time. Consider the associated generalized Lagrangian:
| (3) |
Equation (3) a generalized Lagrangian; it would reduce to a standard Lagrangian in the special case where is taken to be linear for all , i.e. where , being standard Lagrange multipliers [15].
Equation (3) can be alternatively written as
| (4a) |
where is defined as
| (4b) |
Equation (4b) a generalized Hamiltonian; it would reduce to a standard Hamiltonian in the special case where is taken to be linear for all , i.e. where , [15].
Consider a saddle-point of in (3), (4a), (4b) where satisfies
| (5) |
for all . Following Gould [19], an important linkage between problems (2) and (5) is stated next. (All proofs are presented in Appendix A).
Lemma 1
Consider a saddle-point of in (5). Then is a solution to problem (2) with .
Lemma 1 shows that a saddle-point of the generalized Lagrangian in (5) provides a dual representation to the primal problem (2). It states two important properties. First, it shows that finding a saddle-point of the generalized Lagrangian gives a sufficient condition to find a solution to problem (2). Second, the dual formulation introduces the functions . As discussed below, these dual functions provide additional information about the nature of energy dynamics in (2).
While the saddle-point problem in (5) is a sufficient condition to find a solution to problem (2), in general it is not a necessary condition. Indeed, there are scenarios where a saddle-point problem in (5) does not exist, in which case the strong duality result would not hold and duality would break down. It means that some regularity conditions need to be imposed on problem (2) to guarantee strong duality. These conditions have been explored in previous literature [20,21]. Below, we assume that these conditions are satisfied.
The next result illustrates the usefulness of the dual formulation in (5).
Lemma 2
(Generalized minimum principle): If is a saddle-point of in (5), then for all , it satisfies3
(6a)
(6b)
(6c) Lemma 2 is a “generalized minimum principle”. Indeed, equation (6a), (6b), (6c) generalize some well-known results found in previous literature. Two special cases of (6a), (6b), (6c) are of particular interest. First, consider the case where equation (1) takes the form: where . Then using (4b), an interior solution to the minimization problem in (6c) implies
Substituting this result into equation (6b) and using (4b) yields the following result.
Corollary 1: Assume that and in (1). If is a saddle-point of , then for all , equations (6a), (6b), (6c) reduce to
| (7) |
Equation (7) is the standard Euler-Lagrange equation of optimal control [15]. The Euler-Lagrange equation (7) has been a cornerstone in physics [6,7], including classical mechanics [2,8,9,17] and special and general relativity [10]. Corollary 1 states a nice result: the Euler-Lagrange equation is a special case of Lemma 2. But this result is obtained assuming that , which is a rather restrictive condition for state dynamics. This indicates that equation (7) would not apply in situations where . This identifies an important limitation of the Euler-Lagrange equation: equation (7) is not expected to hold in systems exhibiting more complex dynamics.
Second, consider the special case where where is restricted to include only linear functions. In this case, means that can be written as . Then, the generalized Lagrangian in (3) becomes the standard Lagrangian , and the generalized Hamiltonian in (4b) becomes the standard Hamiltonian where are Lagrange multipliers. In this context Lemma 2 gives the following result.
Corollary 2:
Assume that with and . Then, for , equation (6a), (6b), (6c) become4:
(8a)
(8b)
(8c) Equation (8a), (8b), (8c) are Pontryagin’s principle of optimal control [14,15,22]. This is another nice result: the Pontryagin principle of optimal control is a special case of Lemma 1-2. Comparing (6) and (8) shows that in (6b) plays the role of the Lagrange multipliers in (8b). The interpretation of is discussed below.
Assuming that a saddle-point exists in (5) and using Lemma 1, equation (6a), (6b), (6c) in Lemma 2 are necessary conditions for to be a solution to problem (2). Similar arguments apply to equation (7) in Corollary 1, or equation (8a), (8b), (8c) in Corollary 2. But in general, these equations are not sufficient. This raises the question: Under what conditions would these equations be both necessary and sufficient conditions to characterize a solution to problem (2)? This question has been addressed in the context of equation (8a), (8b), (8c) applied to a standard Lagrangian approach [15,23]. Mangasarian [23] and Luenberger [15] showed that equation (8a), (8b), (8c) can become sufficient conditions for to be a solution to problem (2) under convexity, where convexity is defined as situations where is a convex function and the feasible set is convex. Indeed, under convexity, the separating hyperplane theorem applies establishing the existence of a separating hyperplane with slopes given by the Lagrange multipliers [15, p. 219]. This argument indicates that, without convexity, equation (8a), (8b), (8c) would in general not be sufficient for to be a solution to problem (2). As noted above, we do not assume that is a convex function nor that the feasible set is convex, i.e. we allow for nonconvexity. In this context, equation (8a), (8b), (8c) become less useful: they may no longer characterize the solution to problem (2). This is a situation where equation (6a), (6b), (6c) (developed under a generalized Lagrangian in (3)) have an important advantage compared to (8) (developed under a standard Lagrangian): from Lemma 1, the saddle-point problem (5) provide sufficient conditions for to be a solution to problem (2) even under nonconvexity.
These arguments are illustrated in Fig. 1, Fig. 2. Fig. 1 presents a situation under convexity (where is a convex function and the feasible set is convex). In Fig. 1, the optimal solution is at point O. At this point, there is a separating hyperplane EOF that goes through point O while being tangent (at point O) to both the feasible set (with AOB as a lower bound) and the isovalue line5 COD. In this case, convexity supports the existence of a separating hyperplane and of associated Lagrange multipliers; and convexity means that equation (8a), (8b), (8c) (developed under a standard Lagrangian approach) provide necessary and sufficient conditions to identify the optimal point O.
Fig. 2a-b present situations under nonconvexity where neither is a convex function nor is convex set. They identify two points O and O’. In Fig. 2a (evaluated at time ), the global solution to problem (2) is at point O and not at point O’. Point O’ is suboptimal (it provides only a “local solution” valid in the neighborhood of O’). Fig. 2a shows the presence of a hyperplane EOF that goes through point O while being tangent to both the feasible set (with AOB as a lower bound) and the isovalue line COD. But under nonconvexity, the hyperplane EOF does not have the separation property: the hyperplane intersects the isovalue line COD when is small; and it enters the feasible set when is large. This failure to have a separating hyperplane is due to non-convexity. As discussed below, this failure is important: it means that equation (8a), (8b), (8c) would be satisfied at both O and O’, reflecting that equation (8a), (8b), (8c) can fail to identify a solution to problem (2) (as point O’ is suboptimal). This provides the motivation to switch from a standard Lagrangian to the generalized Lagrangian in (3): switching from a linear to a nonlinear is equivalent to switching from a (non-separating) hyperplane EOF to a separating nonlinear hypersurface E’OF’ in Fig. 2a. Similar arguments apply to Fig. 2b (evaluated at time ).
In Fig. 2a-b, the hyperplane EOF and the separating hypersurface are both tangent to the lines AOB and COD, illustrating why equations (6a), (6b), (6c), (8a), (8b), (8c) would hold in the neighborhood of point O. Yet, allowing for nonlinear in the generalized Lagrangian means that the slopes of are no longer constant. From Fig. 2a–b, a separating hyperplane may no longer exist under nonconvexity; but the separation property still applies to a nonlinear hypersurface. Under nonconvexity, the functions must be allowed to be nonlinear to guarantee the separation property. This shows why the standard Lagrangian approach is overly restrictive: without convexity, it is inappropriate to assume that the functions are linear (i.e., that the Lagrange multipliers are constant). This motivates switching to the generalized Lagrangian in (3) where the function are allowed to be nonlinear. In this case, the slopes of in (3) are not constant, i.e., its marginal values can vary with the evaluation point .
To stress the importance of switching to the generalized Lagrangian in (3), note that Lemma 1 guarantees to find a global solution (denoted by point O in Fig. 2a-b). Thus, Lemma 1 avoids the issue related to the existence of suboptimal points (such as O’ in Fig. 2a-b) where equation (7) or (8) would be satisfied. In addition, equation (6a), (6b), (6c) provide useful information about optimal dynamics under nonconvexity. This includes the dynamics of given in equation (6b) which involves the term . When is nonlinear (under nonconvexity), its slopes would change with the evaluation point . In this context, equation (6b) shows that the evolving slopes of play an important role in energy dynamics.
To provide insights into the interpretation of , consider the perturbation function:
(9) where and denotes an exogenous change in . From (2), noting that , it follows that reflects the effects of a change in energy flows on the optimal total energy . When the function is differentiable in at point , Chavas [16] showed that
(10) where is the derivative of with respect to at time . Equation (10) is a version of the envelope theorem, stating that is the marginal amount of energy generated by a small change in the states at time . When is linear, then and (10) reduces to the standard interpretation of Lagrange multipliers, with measuring the marginal values of the constraints [15, p. 222]. Equation (10) generalizes this interpretation to situations where is nonlinear. With measuring the marginal rate of transformation of the states into energy and noting that where is a smooth path of from to , it follows that can be interpreted as a measure of the implicit energy generated by the states at time . Define
(11) Using (10), the term in (11) is the total implicit energy generated by the states during the period . Using equation (4a) evaluated at the optimum, we have
(12) The term is the total generalized energy defined as actual total energy of the system plus the total implicit energy generated by the states over the period . It follows from (12) that the term can also be interpreted as the total generalized energy in the system. This gives the following useful interpretation of the Hamiltonian: at the optimum, the Hamiltonian provides a measure of the generalized energy associated with the system at time . Thus, our analysis identifies the presence of three types of energy at time : actual energy , implicit energy , and generalized energy measured by the Hamiltonian . This raises the question: what does the evolution of generalized energy tell us about energy conservation? This is the topic of the next section.
Fig. 1.
Optimality under convexity
Note: The isovalue line is the set of points that gives the same value of the objective function as the one obtained at the optimum in the minimization problem (2).
Fig. 2.
Optimality under nonconvexity, 2a: Evaluated at time
2b: Evaluated at time .
3. Results and discussion
As just discussed, we can study the conservation of energy by evaluating whether generalized energy (as measured by the Hamiltonian ) is constant over time. With a focus on the role of nonconvexity, our main result is stated next.
Proposition 1: Assume that and are independent of and that . Evaluated at the optimum ,
-
a)
If for all , then is constant over time with
| (13) |
-
b)
If is a nonlinear function, , then can vary over time as it satisfies
| (14) |
Under condition C1, equation (13) is a well-known result: at the optimum in (2), the Hamiltonian is constant over time [24]. This is Noether's theorem, stating that the conservation of energy holds in an isolated system if the laws of physics do not change over time [1,13]. This result is obtained under a standard Lagrangian which applies under convexity (when the separating hyperplane theorem holds and is taken to be linear). But equation (14) gives a different result: it states that that the Hamiltonian is not necessarily constant under a generalized Lagrangian approach where the function is allowed to be nonlinear. Equation (14) shows that would over time when , i.e. when the total shift in its shift due to a slope change. The need for to be nonlinear arises under nonconvexity (as illustrated in Fig. 2). This is an important result: even when the laws of physics do not change, the conservation of energy no longer holds under nonconvexity. Note that, applied to generalized energy, these findings are obtained under wide-ranging conditions, allowing for multiple types of energy with dynamics driven by multiple state variables.
The results stated in Proposition 1 hold under conditions C1 (the time-invariance of the laws of physics) and condition C2 (the principle of least action). In this context, we obtain two important results: 1) while the standard Lagrangian approach is appropriate under convexity, the analysis needs to be extended to a generalized Lagrangian under nonconvexity; 2) under condition C1, while the conservation of energy applies under convexity, it does not necessarily hold under nonconvexity.
These results indicate that nonconvexity can play an important role in the dynamics of energy. Our analysis has several important implications. If the departure from convexity occurs only in specific neighborhoods, then it is possible that different dynamic patterns would develop in different neighborhoods. Importantly, this argument holds while keeping the laws of physics constant across time and space. A good example is given by apparent differences between quantum physics (holding in the large) versus relativistic physics (holding in the small) along with unsuccessful attempts to integrate them into a unified theory [25]. Addressing these issues raises several challenges. First, our analysis stresses the limitations of local dynamics: a good understanding of local dynamics can fail to provide an accurate representation of global dynamics. Second, we need to understand better how nonconvexity can generate apparent differences in dynamic patterns. Third, how could we use this information to generate hypotheses on how dynamics can vary across neighborhoods? Fourth, how could we obtain measurements that would help distinguish between local dynamics and global dynamics?
Nonconvexity also has implications for dynamics over time. This is illustrated in Fig. 2a-b. Fig. 2a and b shows the optimum (point O) at time and respectively. The optimum in each Figure occurs in a different region of the state space, each region exhibiting its own local dynamics, with the two regions being separated by zones exhibiting nonconvexity. When , it means that between and , the system dynamics must go through a transition phase. One possibility is that the system would stay in one region for some extended period (before time ), switch to another region between time and , and then stay there after time ’. In this case, due to nonconvexity, the dynamics would change significantly in the transition phase, even while the laws of physics remain unchanged. This is illustrated in Fig. 2a-b, showing that the dynamics are qualitatively different before, during and after the transition. This scenario seems to fit the story of cosmic inflation related to changing dynamics in the early universe, with an inflationary period believed to have occurred a fraction of a second after the Big Bang, period during which significant changes occurred in the relative role of gravity, electromagnetism, the strong and the weak nuclear forces [26]. These changing patterns are a reminder about the limitations of extrapolations beyond current observations. They also indicate a need to refine our understanding on how nonconvexity could contribute to shifting dynamics in the evolution of physical systems.
From Proposition 1, nonconvexity can play a significant role in the evolution of energy. Proposition 1a shows that, under convexity and assuming invariant laws of physics (condition C1), a physical system satisfying the least action principle (condition C2) would imply the conservation of energy (Noether's theorem). But Proposition 1b leaves open the possibility that energy could increase under some scenarios. Importantly, Proposition 1b applies only under a generalized Lagrangian-Hamiltonian (where the Lagrange multipliers are not constant). Since the need for a generalized Lagrangian-Hamiltonian arises only under nonconvexity, it follows that the prospects to see increases in energy (as measured by the generalized Hamiltonian) must be associated with nonconvexity. As noted in the introduction, this can happen in the presence of catalytic factors. First, by stimulating local energy transfers toward specific states, catalytic factors can generate local nonconvexity. Second, such transfers can possibly contribute to increased energy (at least in some neighborhoods). This scenario seems particularly relevant in evaluating the prospects for the origin of life (as life would be unlikely to develop in situations where energy is scarce). As stated in equation (14), the generalized Hamiltonian can change over time when the marginal values of the states are nonlinear; and it would increase over time when , i.e. when the total shift in exceeds its shift due to a slope change. This can help identify situations where energy could increase over time. Such insights could prove useful in the search for life on exoplanets.
Finally, our analysis is based on the dynamic minimization problem in (2), where the objective function measures energy flows in a closed physical system. This is the primal problem where the state variables reflect the evolution of energy-momenta in the system. The associated dual problem is given in equations (3), (4a), (4b), (5), involving the functions where measures the marginal rate of transformation of the states into energy at the optimum. As discussed in Cortes and Smolin [27], problem (2) may involve no space-time variables, no locality, no Planck length, and no uncertainty, while the dual variables could represent measures of space-time. In this case, energy-momenta would be treated as fundamental variables in the primal problem, while space-time would emerge from its dual formulation. As we argued, there is a need to consider to be nonlinear functions under nonconvexity. If the dual variables are emerging space-time variables, then space-time would exhibit nonlinear dynamics. Examining the linkages between nonconvexity and the dynamics of space-time is a good topic for future research.
4. Conclusion
Our analysis has shown that, while Noether's theorem holds under convexity, it does not hold under nonconvexity. As a result, the conservation of energy does not apply in general. We have argued that non-convex situations can arise in the presence of catalytic effects or in transitions between alternative regimes. By relaxing the law of conservation of energy, our analysis provides new insights into energy dynamics. It offers new directions for scientific inquiries, including improved understanding about the origin of life, the evolution of the early universe and the nature of space and time.
Data availability statement
Data availability or data sharing is not applicable to this article as no data set was created or analyzed in this study.
CRediT authorship contribution statement
Jean-Paul Chavas: Writing – review & editing, Writing – original draft, Project administration, Methodology, Investigation, Formal analysis, Conceptualization.
Declaration of competing interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Footnotes
Letting be a convex subset of , a differentiable function is convex if and only if it satisfies for all ; and for , being a convex function implies that the set is a convex set.
More generally, could be generalized coordinates defined in a -dimensional manifold .
In the case where the terminal time is free, using (A4) in Appendix A, the following equation would be added to (6a)-(6c): .
From footnote 3, if the terminal time is free, the following equation would be added to (8a)-(8c):
The isovalue line is the set of points that gives the same value of the objective function than the one obtained at the optimum in the minimization problem (2).
Appendix A. Proofs
Proof of Lemma 1: The first inequality in (5) implies that
| (A1) |
Assume that for some . It follows that there exists a and an interval , satisfying either or for all . Let . Consider and , where is an indicator function. Note that and . Let if for . This would imply that , i.e. that does not have an upper bound. But this contradicts (A1). Thus, we conclude that
| (A2) |
Using (A2), the second inequality in (5) states that
| (A3) |
which implies that is a solution to problem (2). Using equations (A2), (A3), equations (2), (3) also imply the strong duality result: .
Proof of Lemma 2: Equation (6a) is equation (A2). Consider a feasible path , where denotes small changes in . Using (4a), (4a) first-order Taylor series expansion in the neighborhood of , we have:
| (A4) |
using integration by parts. Using the second inequality in (5), we have for all feasible . From (A4), this implies
for all feasible , which yields (6b).
Finally, note that the second inequality in (5) implies that for all . Then, equation (6c) follows from (4a).
Proof of Proposition 1: When , equation (6c) implies that , and equation (8c) implies that .
-
Case a)
If is linear for all , we have and where are standard Lagrange multipliers, . Then, evaluated at and ,
using equations (8a), (8b), (8c)
,which is equation (13).
-
Case b)
If is nonlinear, , then and
using equations (6a)-(6c)
which is equation (14).
References
- 1.Noether E. “Invariante Variationsprobleme". Nachrichten von der Gesellschaft der Wissenschaften zu Göttingen. Mathematisch-Physikalische Klasse. 1918:235–257. [Google Scholar]
- 2.Cline D. University of Rochester; Rochester, NY: 2018. Variations Principles in Classical Mechanics. [Google Scholar]
- 3.Maudlin T., Okori E., Sudarsky D. On the status of conservation laws in physics: implications for semiclassical gravity. Studies of History and Philosophy of Modern Physics. 2020;69:67–81. [Google Scholar]
- 4.Carroll S.M., Lodman J. “Energy non-conservation in quantum mechanics”. Found. Phys. 2021;51(4):83. [Google Scholar]
- 5.Aharonov Y., Popescu S., Rohrlich D. On Conservation Laws in Quantum Mechanics” PNAS. 2021;118(1) doi: 10.1073/pnas.1921529118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Feynman R. ume 1. Addison Wesley; Reading, MA: 1964. (The Feynman Lectures on Physics). [Google Scholar]
- 7.Feynman R. ume 2. Addison Wesley; Reading, MA: 1964. (The Feynman Lectures on Physics). [Google Scholar]
- 8.Goldstein H. Addison-Wesley Pub. Co.; Reading, MA: 1980. Classical Mechanics. [Google Scholar]
- 9.Brizard A.J. World Scientific Publishing Company; Singapore: 2014. Introduction to Lagrangian Mechanics. [Google Scholar]
- 10.Foster J., Nightingale J.D. Springer; New York: 2006. A Short Course in General Relativity. [Google Scholar]
- 11.Stehle P.M. In: Encyclopedia of Physics. S P., editor. Mc-Graw-Hill; New-York: 1993. “Least-action principle” in parker. [Google Scholar]
- 12.Ho V.B. On the principle of least action. Int. J. Phys. 2018;6(2):47–52. [Google Scholar]
- 13.Hanc J., Tuleja S., Hancova M. Symmetries and conservation laws: consequences of Noether's theorem. Am. J. Phys. 2004;72(4):428–435. [Google Scholar]
- 14.Pontryagin L.S., Boltianski V., Gamkrelidze R., Mitchchenko E. John Wiley and Sons; New York: 1962. The Mathematical Theory of Optimal Processes. [Google Scholar]
- 15.Luenberger D. John Wiley and Sons; New York: 1997. Optimization by Vector Space Methods. [Google Scholar]
- 16.Chavas J.P. “Optimal control under nonconvexity: a generalized Hamiltonian approach”. Optim. Control Appl. Methods. 2023;44:2679–2692. [Google Scholar]
- 17.Shepherd T.G. Symmetries, conservation laws, and Hamiltonian structure in geophysical dynamics. Adv. Geophys. 1990;32:287–337. [Google Scholar]
- 18.Boscain U., Signalotti M., Sugny D. Introduction to the Pontryagin maximum principle to quantum optimal control. PRX Quantum. 2021;2(3) [Google Scholar]
- 19.Gould F.J. Extensions of Lagrange multipliers in nonlinear programming. SIAM J. Appl. Math. 1969;17:1280–1297. [Google Scholar]
- 20.Slater M. 1950. “Lagrange Multipliers Revisited” Cowles Commission Discussion Paper No. 403. [Google Scholar]
- 21.Rubinov A.M., Huang X.X., Yang X.Q. The zero duality gap property and lower-semicontinuity of the perturbation function. Math. Oper. Res. 2002;27:775–791. [Google Scholar]
- 22.Chang D.E. vol. 47. 2011. pp. 630–633. (A Simple Proof of the Pontryagin Maximum Principle on Manifolds” Automatica). [Google Scholar]
- 23.Mangasarian O.L. Sufficient conditions for the optimal control of nonlinear systems. SIAM J. Control. 1966;4:139–152. [Google Scholar]
- 24.Little G., Pinch E.R. “The Pontryagin maximum principle: the constancy of the Hamiltonian”. Journal of Mathematical Control and Information. 1996;13(4):403–408. [Google Scholar]
- 25.Hansson J. “ the 10 biggest unsolved problems in physics”. Int. J. Mod. Phys. 2015;1(1):12–16. [Google Scholar]
- 26.Guth A.H. Basic Books; New York: 1997. The Inflationary Universe: the Quest for a New Theory of Cosmic Origins. [Google Scholar]
- 27.Cortês M., Smolin L. “The universe as a process of unique events”. Phys. Rev. D. 2014;90(8) [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
Data availability or data sharing is not applicable to this article as no data set was created or analyzed in this study.


