Abstract
Thermodynamic selection is an indirect competition between agents feeding on the same energy resource and obeying the laws of thermodynamics. We examine scenarios of this selection, where the agent is modeled as a heat-engine coupled to two thermal baths and extracting work from the high-temperature bath. The agents can apply different work-extracting, game-theoretical strategies, e.g. the maximum power or the maximum efficiency. They can also have a fixed structure or be adaptive. Depending on whether the resource (i.e. the high-temperature bath) is infinite or finite, the fitness of the agent relates to the work-power or the total extracted work. These two selection scenarios lead to increasing or decreasing efficiencies of the work-extraction, respectively. The scenarios are illustrated via plant competition for sunlight, and the competition between different ATP production pathways. We also show that certain general concepts of game-theory and ecology–the prisoner’s dilemma and the maximal power principle–emerge from the thermodynamics of competing agents. We emphasize the role of adaptation in developing efficient work-extraction mechanisms.
I. INTRODUCTION
Thermodynamics studies energy transfer, storage, and usage. It started as a theory of heat engines, drove the Industrial Revolution, and matured at nearly the same time when evolutionary biology emerged. Nowadays, thermodynamics is perhaps the most general phenomenological theory in all of science that applies to all types of systems at all levels of organization. Several attempts have been made to represent various aspects of evolutionary biology, in particular, evolution of populations and ecosystems, within the framework of thermodynamics [1-11].
Here we develop a thermodynamic approach to selection. Its main premise is that once organisms (agents) extract work (useful energy) and obey the laws of non-equilibrium thermodynamics in their metabolism [12], they can be modeled as heat engines. Agents interact (compete) indirectly, if the extraction goes from the same source. This competition can be represented via game theory, and hence its outcome depends on work-extraction strategies adopted by the agents. Such strategies depend on two parameters: efficiency and power. The second law of thermodynamics states that the efficiency of any heat engine—defined as the ratio of useful extracted energy (work) to the total energy input (heat)—is bound from above by Carnot’s efficiency [13-15]. But heat engines operating at the maximum efficiency yield zero work per unit of time (zero work power) resulting in the well-known power-efficiency tradeoff [16-19]: the most efficient regime is not powerful, whereas the most powerful regime is not efficient [20].
The energy budget of an organism can be described as three main energy currents: input, storage, and output (waste) [4, 21-23]. The relationship between these three currents are similar to that in a generalized heat engine: input heat, work (storage), and output heat. Similar to abiotic heat engines, organisms also face the power-efficiency (or speed-fidelity) trade-off. In particular, this trade-off is seen in molecular machines of cells [24-27], and also at the level of organism phenotypes [28-34]. The power-efficiency trade-off is subject to selection and depends on available energy resources.
Hence, our goal is to explore a physical model for the evolution of the metabolic power-efficiency trade-off, where agents are modeled as heat engines. We do not specify how the extracted work is utilized (reproduction, metabolism, defense, or other functions). Instead, we focus on different strategies (phenotypes) that are available to the agents to extract and store energy. The competition and selection emerge because at least two agents employ the same source (high-temperature bath). There are two general scenarios for such competition, for effectively infinite and for finite—and hence depletable—resources. The quantities relevant for evolution in these two situations are, respectively, the power of work extraction and the stored energy (=total extracted work).
Competition for an infinite resource is analogous to the competition of plants for light. Here the source, i.e. the Sun, acts as a thermal bath providing high temperature photons for the heat engine operation of the photosynthesis. It is not depletable, and yet, there is a competition for a limited energy current reaching the forest surface [63-73]. Plants can behave differently when facing such competition, from confrontation to avoidance of the competitor [71, 74, 75]. In section III we formalize and examine these situations that can have more general relevance in the context of nutrient allocation between cells in multicellular organisms. In particular, we show that the competition leads to increasing the efficiencies consistently with observations.
Exploitation of a finite source is a dynamical process, since this source is depleted due to the functioning of the agents themselves. We study this process in section IV and show that competition favors heat engines with lower efficiencies. An example of this is the fermentation (aerobic and anaerobic) and respiration pathways of ATP production in yeasts [31, 32, 34, 83, 84, 86] and in solid tumor cells [90-93]. Here the ATP production refers to work-extraction and storage [47]. Respiratory ATP production is far more efficient than fermentation, but the speed and hence the power of the fermentation path is greater [76, 83, 85]. Given the available resources and the presence of competition, cell choose one or the other pathway of ATP production [31, 83, 86].
Agents competing for a depletable resource alter the common environment similarly to what happens in niche construction theories [98-100]. Thereby they shape the selection process. Hence, we face a non-trivial game-theoretic situation, where the optimal values of power and efficiency under competition are not unique. However, the environmental changes caused by the behavior of competing agents are “myopic”, that is, the behavior of the agents is not based on perception of the global environmental state.
The common environment of competing agents changes due to the very engine functioning. This fact poses the problem of adaptive (i.e. structure adjusting) versus non-adaptive agents. This is analogous to the phenotype adaptation that is observed in organisms [28, 77-81, 94-97]. As seen below, adaptation plays an important role in selection process.
The rest of this paper is organized as follows. Next section defined the heat engine model we employ. Section III addresses competition for an infinite resources that amounts to sharing a fixed energy current. Section IV studies the competition for a finite resource. We explore this situation via studying two competing agents that can be adaptive or not. Sections III and IV can be read independently from each other. Both sections employ ideas and techniques from game theory, though no deep preliminary knowledge on this subject is assumed, since we provide the necessary background. We summarize in the last section. All technical derivations are relegated to Appendices.
II. THERMODYNAMIC AGENT (HEAT ENGINE)
A. Heat-engine model
To model energy extraction and storage, we focused on the minimal thermodynamically consistent model of a heat engine [18, 52]. For further clarity, we start with the explicit implementation of this model via three-energy level Markov systems attached to different heat baths at different temperatures; see Fig. 1 and Appendix A for details. Having an explicit model is essential for clarifying the nature of the involved parameters and the extracted work (stored energy). However, the model will be explored in the high-temperature (linear response) regime, where the implementation details are not essential, and where it is equivalent to linear thermodynamic models employed in biophysics [52].
FIG. 1:
A schematic representation of the heat-engine model (1-10), and related thermal baths that cause transitions between the three engine states with energies Ei, i = 1, 2, 3. Now ρi←j is the transition rate from state j to state i. The symbol ρi↔j in the figure means that both transitions ρi←j and ρj←i are allowed for. Jh, Jc, J are heat currents from the thermal baths with temperature Th, Tc, T, respectively. Note that the thermal baths drive transitions between the energy levels.
The engine has three states i = 1, 2, 3. This is the minimal number of states a stationary heat engine can have, because it should be in a non-equilibrium state (i.e. to support one cyclic motion), and has to support three external objects, one work-source and two thermal baths. Each state i has energy Ei, such that
| (1) |
Transitions between each pair of different states are caused by the different thermal baths having different temperatures (Th, Tc, T) that accordingly provide or accept necessary energies; cf. Fig. 1. We assume that these thermal baths are in thermal equilibrium states, which means that the transition rates that drive Markov evolution of the engines obey the detailed-balance condition, for example the transition rates between states {1, 3} satisfy the following relation
| (2) |
where βh = 1/Th. Similar relation holds for the transition rates ρ1←2 and ρ2←3 caused by thermal baths with temperature T and Tc, respectively.
One temperature is assumed to be infinite [37]: β = 1/T = 0. This bath is then a work-source. This key point can be explained as follows. First, note that an infinite temperature thermal bath exchanges dE energy without changing its own entropy, dS = βdE = 0, which is a feature of mechanical device (a sources of work) [37]. Second, if the T = ∞-bath spontaneously interacts with any (positive) temperature bath, then the former bath always looses energy. Hence, its energy is freely convertible to any form of heat, as expected from work. Next, we assume Th > Tc, as necessary for heat engine operation.
In the stationary (but generally non-equilibrium) state the average energy of the three-level system is constant
| (3) |
where pi is the probability of finding the system in energy level Ei. Here J and Jn with n = h, c are the average energy lost (for J, Jn > 0) or gain (J, Jn < 0) by each bath per unit of time. Eq. (3) is the first law of thermodynamics for a stationary state [44]. Now (3) indicates on a perfect coupling between the thermal baths and the three-level system: there is no an energy current standing for irreversible losses within the system; cf. [27, 32, 85, 87-89].
In the stationary state, the energy currents hold
| (4) |
| (5) |
where 𝓩 is the normalization factor defined by transition rates (see Appendix A).
If the system functions as a heat engine, i.e. on average, pumps energy to the work-source, then
| (6) |
Using Eq.(5,4) and the condition (6) one get the condition for the system to operate as a heat-engine
| (7) |
The efficiency of any heat engine is defined as the result (the extracted work) divided over the resource (the energy coming from the hot bath). Recalling E3 > E2, the efficiency η reads from (7):
| (8) |
Hence, the efficiency η is bounded from the above by the Carnot efficiency nC. Eq. (8) is the second law of thermodynamics for the heat engine efficiency [44].
Eqs. (4, 7, 8) demonstrate the power-efficiency trade-off: at the maximum efficiency η = ηC the power −J of the heat engine nullifies. This trade-off is also a general feature of heat engines [19]. A clear understanding of this trade-off is one pertinent reason for having an explicit microscopic model of a heat engine.
The work power −J depends on the specific form of the transition rates that enter the detailed balance condition (2) (see Appendix A); for example the Arrhenius form of transition rates applies in chemical reaction dynamics [43]. Here we work in the high-temperature limit, where the details of rates are not important provided they hold the detailed balance. Now Ei/Tc ≪ 1 and Ei/Th ≪ 1, but 0 ≤ ϑ ≤ 1 in (7) can be arbitrary. In this limit the heat engine power reads via Eq.(4, 8):
| (9) |
where is a constant. Eq. (9) shows that for a fixed ϑ the maximum power of ∣J∣ of the engine is attained for
| (10) |
B. Relevance of heat-engines in biology
Below we shall heuristically apply the heat engine model to ATP production, where the two thermal baths refer to the e.g. glucose (the major reactant of the ATP production), while the work corresponds to the energy stored in ATP, which is metastable at physiological conditions. In this context let us discuss to which extent heat-engine models can be applied to transformations of chemical energy. The standard understanding of the chemical energy stored in certain molecular degrees of freedom is that it is isothermal and is described by the (Gibbs) free energy difference between reactants and products. This coarse-grained description does not tell where (in which degrees of freedom) the energy was stored and how it came out. Detailed mechanisms of such processes are still unclear, e.g. there is a long-standing and on-going debate on how precisely ATP delivers the stored energy during its hydrolysis and how this energy is employed for doing work; see e.g. [45-47]. However, it is clear that at sufficiently microscopic level all types of stored energy should be related to the fact that certain degrees of freedom are not in the thermal equilibrium with the environment [47]. Indeed, if all degrees of freedom would be thermalized at the same temperature, the second law will not allow any work-extraction 1. It is known that frequently such non-thermalized degrees of freedom can be described by different effective temperatures [48]. Moreover, even a finite non-equilibrium system can (under certain conditions) play the role of a thermal bath, since the dynamics of its subsystem obeys the detailed balance condition [49]. Thus when describing work-extraction from chemical energy, it is meaningful to assume two different thermal baths, which is in fact the simplest situation of a non-equilibrium system. Modeling work-extraction through different chemical potentials (a situation closer to the standard understanding of the stored chemical energy) is in fact structurally similar to heat-engines [50, 51], also because we work in the high-temperature limit, where many implementation details are irrelevant. In this limit our model is fully consistent with linear equilibrium thermodynamics [18, 52]. Similar models have been widely employed in bioenergetics for modeling coupled chemical reactions, where the passage of heat from higher to lower temperatures corresponds to the down-hill reaction, whereas work extraction corresponds to the up-hill reaction [52].
III. COMPETITION FOR FIXED CURRENT
Two agents 1 and 2 competing for the same resource can be described as two heat engines attached to the same thermal baths [cf. (9)]. The sum of the currents coming from the high-temperature bath Th to each heat engine is bound from above [cf. (6)]:
| (11) |
where A is a positive constant, and J1 h and J2 h are the currents coming from the same Th-bath to each heat engine; cf. (6).
Using (9, 5) we write constraint (11) as
| (12) |
| (13) |
Here E3 k is the third (highest) energy level of agent k = 1, 2, ϵ is the ratio of energy scales of the engines that also determines the difference between 1 and 2. For ϵ = 1, the two agents are equivalent, i.e. the situation is symmetric.
We introduce scaled efficiencies xk = ηk/ηC ∈ [0, 1] for agents k = 1 and k = 2. Then the work extracted by an agent reads
| (14) |
| (15) |
For a in (13) we assume
| (16) |
The first inequality in (16) ensures that the agents cannot simultaneously maximize 𝓦1 and 𝓦2 at η1 = η2 = (1 − ϑ)/2 (i.e. at x1 = x2 = 1/2), which refers to the maximum power regime (10) for both agents.
The second inequality in (16) means that (12) still allows some x1 < 1 and x2 < 1. Thus, Eq. (11) creates a non-trivial competition between the agents, which translates into the following game theoretical problem
| (17) |
| (18) |
where xk and 𝓦k are the actions and payoffs of the agents, respectively.
Thus, the present game has two dimensionless parameters: a and ϵ. Now a > 1/2 means that the constraint is non-trivial, while ϵ defines the payoff asymmetry between 1 and 2; cf. (15).
Game theory offers several concepts of equilibrium that formalize the ill-defined—due to (18)—notion of “joint” maximization for the utilities 𝓦1 and 𝓦2 [54-56]. For the game considered here, only the equilibrium concepts of Stackelberg and Pareto-Nash are non-trivial, and these are examined below.
A. Stackelberg’s equilibrium
Stackelberg’s competition model [53, 55, 57] is a sequential solution of the above game: the first agent (1) is the leader, since it has the advantage of the first move. The second agent (2) is the follower that responds to the first move by 1. Hence 1 chooses x1 = 1/2, which is the unconstrained maximum of 𝓦1. The best response of 2 to this action reads from the first inequality in (18)
| (19) |
where the last inequality amounts to
| (20) |
There are cases where (20) does not hold even though the second inequality in (18) still holds. In such cases, no action by 2 is possible because x2 > 1. Hence the co-existence of two agents is impossible. Satisfying (20) naturally becomes harder for a → 1 and easier for larger ϵ so that (20) always holds for ϵ → ∞. Thus, the follower tends to have a sufficiently large ϵ because otherwise, it is eliminated from the competition. Whenever (20) holds, 2 operates with a higher efficiency than 1, that is, invaders are more efficient [70, 71]. For ϵ < 1 the work extracted by 2 is smaller than the work extracted by 1: 𝓦2 < 𝓦1 = 1/4; cf. (17). However, 𝓦2 > 𝓦1 for ϵ > 1 and a sufficiently small a, that is, the second agent can extract a larger amount of work at a higher efficiency; cf. (17, (19).
B. Pareto line, Nash equilibria and evolutionary stability
A pair is a Pareto equilibrium of the game if for any allowed and it holds [54-56]
| (21) |
| (22) |
| (23) |
| (24) |
Eqs. (21-24) mean that if a change from benefits one agent, it implies losses for another agent. Now is an equilibrium, as far as there is a certain cooperation between the agents preventing any move or [54-56]. This contrasts the above Stackelberg’s solution, where agent 1 chooses the optimal condition for itself, irrespective of what happens to 2.
Eqs. (17, 18) show how to find the set of Pareto equilibria (i.e. the Pareto line). Note that both 𝓦1 = x1(1 − x1) and 𝓦2 = ϵx2 (1 − x2) achieve their absolute maxima at (resp.) x1 = 1/2 and x2 = 1/2. Hence the Pareto line is the line x1 + ϵx2 = a(1 + ϵ) possibly bound by lines x1 = 1/2 and x2 = 1/2:
| (25) |
| (26) |
Examples of Pareto lines (25, 26) are presented in Fig. 2 for different values of model parameters.
FIG. 2: The Pareto line (25, 26) of the game (17, 18) for different values of a, ϵ.
Blue (lower straight) line is x1 + ϵx2 = a(1 + ϵ) with a = 0.6, ϵ = 2. The allowed states are located above this line still confined by the square [0, 1] × [0, 1]. The Pareto line is the fraction [A, B] of this line that lies between intersections with x1 = 1/2 and x2 = 1/2; cf. (26). The worst outcomes for the players—defined by (28, 29, 30)—are and .
Red (upper straight) line is x1 + ϵx2 = a(1 + ϵ) with a = 0.8, ϵ = 0.8. Now the whole red line is the Pareto line. The worse outcomes for the players are .
Note that there is a continuum of Pareto equilibria so that additional reasoning is needed to select a unique outcome. Moreover, for this game, all Pareto equilibria are also Nash equilibria.
Let us discuss this point in more detail, and also outline relations with evolutionary stable strategies. Recall that a Nash equilibrium of the game with payoffs Wk(x1, x2) and strategies x1 and x2 for each agent k = 1, 2 is a pair of strategies such that for any allowed and
| (27) |
Eqs. (27) mean that for each agent any unilaterial deviation from is not beneficial. If the inequalities in (27) are strict, the Nash equilibrium is said to be strict. Note that in our problem the payoff of the players depend on the opponent’s strategy via constraint (18). For 1 the best response strategy to the given strategy x2 of 2 reads: if 1/ϵ (a(1 + ϵ) − 1/2) ≤ x2 ≤ 1; otherwise: . Similar relation holds for 2. The intersection of curves and coincides with Pareto line (25, 26).
The Nash equilibria in our situation are strict; hence they are also evolutionary stable [61], since no mutant strategy x1 can obtain higher payoff than the resident strategy .
Note that a refinement of evolutionary stability for continuous strategies—known as continuously stable states [61]—cannot be applied to our situation. Indeed, the refinement demands from a strict Nash equilibrium to be also a local maximum of the payoffs. However, the notion of a local maximum cannot apply to our situation, since the payoff functions are discontinuous on the Pareto line [60], i.e. they are not defined bellow the Pareto line in Fig. 2. Further consideration of evolutionary stability demands implementing mixed strategies which goes out of our present scope.
C. Worst outcomes
For any game it is relevant to know what worst thing agent 2 can do for 1, at any expense for 2. Besides describing hostility, worst outcomes will be relevant for bargaining solutions below. The worst outcome for agent k = 1, 2 read:
| (28) |
To find out , note from (18) that any choice x1 of 1 should hold x1 ≥ a(1 + ϵ) − ϵx2. If 2 arranged its action x2 such that a(1 + ϵ) − ϵx2 ≥ 1, then there is no choice to be made by 1. For such cases we shall define its utility 𝓦1 = 0, and this is clearly the worst outcome for 1. Now a(1 + ϵ) − ϵx2 ≥ 1 or equivalently [a(1 + ϵ) − 1]/ϵ ≥ x2 will work provided that a(1 + ϵ) − 1 ≥ 0. Hence, if the latter condition holds we get . Otherwise, if a(1 + ϵ) − 1 < 0, then 2 achieves minx2[𝓦1] when the bound x1 ≥ a(1 + ϵ) − ϵx2 is possibly tight, i.e. for x2 = 0, and then x1 = a(1 + ϵ) = argmaxx1 minx2 [𝓦1]. Calculating in the same way we get
| (29) |
| (30) |
Note that : (29, 30) imply if , which is impossible since .
The fact of e.g. is important, since it means that 1 cannot be eliminated from the competition. Note that (29, 30) are not simply the separate global minima of (𝓦1, 𝓦2), i.e. generally . Still we get for ϵ = 1.
D. Fair allocation and the Nash bargaining solution
Bargaining solutions assume additional cooperativity between the agents—that relates to fair division of resources—thereby looking for a unique choice within the Pareto line (25, 26) [54-56, 59]. We shall work with the Nash bargaining solution [54-56, 59]. An advantage of this over other bargaining solutions is the consistency of underlying axioms with thermodynamic processes [58].
The Nash bargaining solution is found from (29, 30) by maximizing the geometric mean of and [56, 58, 59]:
| (31) |
The meaning of (31) is that both 1 and 2 (in a sense) simultaneously maximize their increments over the worst outcomes. Note that only one of and can be non-zero, that is, one of the agents will be competed out in the worst cases; see Appendix B.
Eq. (31) implies that for ϵ = 1 (the same energy scales) the maximizers and of (31) hold . Thus, the resources are allocated evenly between equivalent agents. More generally, (31) predicts and k = 1, 2, that is, none of the agents is eliminated, and none of them operates at the maximum power. Another implication of (31) is that
| (32) |
i.e. for ϵ < 1, 2 extracts less work than 1; see Fig. 3, where 𝓦1 − 𝓦2 is presented for various ϵ and a. It is seen that the difference in the obtained resources of the agents increases with the available resource level (i.e. with decreasing a, cf. (18)). As shown by numerical solution of (31), no simple relation exists for ; see Fig.3. Indeed, acquiring more resource is possible either with higher efficiency level (left-top red region) or low efficiency level (left-bottom blue region).
FIG. 3: The Nash bargaining solution for the game (17, 18).
(a): 𝓦1 − 𝓦2 obtained from (31) for different a and ϵ. Red (blue) regions refer to 𝓦1 − 𝓦2 > 0 [𝓦2 − 𝓦1 > 0]. Darker red (darker blue) colors refer to larger 𝓦1 − 𝓦2 [larger 𝓦2 − 𝓦1]. Lines represent contours of fixed values for 𝓦1 − 𝓦2.
(b): for different a and ϵ, where are found from argmax in (31). Colors follow the above logic.
E. Outlook
Plants compete for sunlight. They evolved various strategies for that: horizontal growth, vertical growth, shade tolerance etc) [63-73]. Photosynthes is a heat engine operating between two thermal baths, the hot photon bath generated by the Sun and the cold bath of the ambient environment. The sunlight is not an exhaustible source (in contrast to food), but plants growing on the same territory can appropriate different portions of the sunlight current depending on their competing abilities and strategies.
We can thus apply the heat-engine model to a single plant. The competition between two plants (i.e. two heat-engines) will be described via (11-15). Now (11, 18) correspond to constraints that determine the competition between two plants (agents): a larger a signifies stronger competition, while the minimal value a → 1/2 of a means no competition. Once ϵ determines the agents asymmetry between the work-currents [cf. (15)], it is natural to relate ϵ to the ratio of the full active leaf area of each plant: ϵ > 1 means that for agent 2 this area is larger.
Plants can have two basic competition strategies:
– (1) direct competition, where 1 (leader) is the native plant, and 2 (follower) invades the already established canopy.
–(2) Avoidance of competition, where some compromise on resource sharing is reached. This regime can refer to the fair solution discussed in (25-32).
Now (1) corresponds to Stackelberg’s solution (19). As follows from (20), the invader survives if its active leaf area is larger than that of the native plant (ϵ > 1), and it operates with a higher efficiency. Both these features are compatible with the observations on plant competition. Examples of invaders that hold these features are Ligustrum Sinense [67], and Schinus molle [68]. Also, there is a general agreement that a high resource environment (lower a) is more vulnerable to invasion than a lower resource one [63, 69-71], which is compatible with (19, 20). Note that more general models should, in particular, account for the fact that the payoff 𝓦k of the agents in the competition non-linearly depend on the ratio of energy scales ϵ. Indeed, generally the light obtained by a plant does not depend linearly on its size [63-65, 67].
Another application of this formalism relates to resource sharing in a multi-cellular organisms, where different cells rely on constant nutrients supply. This assumes cooperation between cells and fair allocation of nutrients based on the resource demand of the cells; hence (2) applies. The expensive germ line hypothesis is an example of resource allocation problem between germ and somatic cells in multicellular organisms [39-42]. Germ cells require more nutrients, resulting in a clearly asymmetrical situation as in (17, 18). On the other hand, the lack of resources for somatic cells could cause senescence [41]. Hence, the resource allocation between germ and somatic cells can be considered as a bargaining process over the available resources. Under a shortage of resources, the organism would distribute the resources between germ and somatic cells more evenly, as can be seen in Fig.3, where ∣𝓦1 − 𝓦2∣ becomes smaller for larger a.
IV. COMPETITION FOR DEPLETABLE RESOURCES
A. Resource depletion
Above we assumed that the thermal baths are very large, hence their temperatures do not change during the heat engine functioning. Now we focus on finite baths that are still large enough so that their temperature(s) changes on time scales that are much longer than the relaxation time of the engine. Because the hotter bath is the source, we shall assume that its temperature Th decreases as the result of the work-extraction. For simplicity, the other two baths will be held at a constant temperature.
Eq. (9) shows that once Th decreases and gets closer to Tc the heat engine will stop functioning at 1 − ϑ = E2/E3. This is still compatible with ϑ < 1, i.e. the baths are still not in equilibrium, but this non-equilibrium cannot be employed by the engine for work-extraction.
The slow dynamics of Th can be deduced from the formula of equilibrium heat-capacity, which governs the change of temperature for a body at equilibrium given the change of its internal energy [44] [cf. [35, 36]]:
| (33) |
where Ch is the constant heat capacity of the bath. Indeed, Ch is large, since it scales with the number of the bath degrees of freedom. On the other hand, Jh ~ 1, hence is small, i.e. Th changes slowly.
B. The setup
For simplicity, we assume that E3 and ρ are the same for both agents: E3 1 = E3 2 = E3 and ρ1 = ρ2; cf. (9). Under these assumptions, the dimensionless power of each agent (k = 1, 2) and the time evolution of temperature ratio ϑ are given by [cf. (9, 8)]
| (34) |
| (35) |
where (35) is found from (33), ηk are efficiencies of agents, and is a constant. We will compare different types of heat engines that employ the same depletable source and compete with each other aiming to increase the stored energy (= extracted work). The heat engine power (34) and its time-integral (stored energy) describe (resp.) short- and long-term advantages of the agent.
We distinguish adaptive and non-adaptive agents in the competition for the same depletable source. The internal structure (difference in energy levels) of non-adaptive agents remains fixed during the time, i.e ηk = const, k = 1, 2 are time-independent. In contrast, adaptive agents continuously tune their time dependent efficiency to the source depletion:
| (36) |
The rationale of choosing the specific adaptation protocol (36) comes from the locally equilibrium thermodynamics; e.g. αk = 1/2 describes the local maximum power according to (10). Appendix C discusses in detail the energy stored by a single agent (adaptive or not).
Adaptation to environmental changes (phenotypic flexibility) is an intrinsic capacity of all organisms [28, 77-81, 94-97]. Adaptive processes are also involved in niche-construction [98-100], where competing agents shape the selection process by altering their own and competitors’ environment. In view of (36) we focus on myopic adaptation, where agents adapt to environmental changes caused by their activity, but do not shape the environment beforehand (that is, do not construct new niches).
We emphasize that (36) is mostly a phenomenological description of adaptation processes. We do not explore internal mechanisms of adaptation, and in particular, we do not account for the full energy cost of maintaining the adaptation; see [37] in this context. The main advantage of (36) (to be explained below) is that allows the heat engine to work till the final depletion of the source, i.e. till θ → 1; cf. the paragraph before (33).
C. Competition between non-adaptive agents
Let us now consider competition of two non-adaptive agents (1 and 2) attached to the same baths. We can take without loss of generality
| (37) |
Hence 2 will have a finite working time determined from (35) via η2 = 1 − ϑ(τ2). We assume that for t > τ2, 2 is detached from the baths. Otherwise, this agent will not function as a heat engine. Hence the competition for the same source brings in a finite consumption time of functioning as heat engine.
For a given initial value ϑ(0) = ϑ0, define the stored energies for (resp.) 1 and 2:
| (38) |
| (39) |
W1(η1, η2) in (38) is composed of two terms. The first term in (38) is the stored energy for time interval τ2, i.e. during the competition. The second term is evaluated once 2 stops work-extraction and 1 operates alone; see Appendix D. Note that the functions and are different due to (35) and the finite working time τ2. is given by (34), and found from (35) where evaluation starts at ϑ(τ2) = 1 − η2.
For a single agent in the absence of competition, the optimal efficiency of the heat engine is determined from maximizing the stored energy (time-integrated power) [see Appendix C]
| (40) |
Eq. (40) is formally similar to the Curzon-Ahlborn efficiency of heat engines [16-18], but the meaning of (40) is different, because (40) follows from the consideration of the full extracted work from a finite bath. We emphasize that (for a single non-adaptive agent) the optimal efficiency (40) is strictly smaller than the maximal (Carnot) efficiency 1 − ϑ0; cf. (8).
Fig.4 shows the competition results described by (38, 39) for two non-adaptive agents 1 and 2. The energy-storing strategies of each agent are parametrized by their efficiencies η1 and η2; recall (37). It is seen that when competing with the efficiency at the maximal initial power [cf. (10)], it is never meaningful for 2 to employ a larger efficiency, i.e. wins over all ; see Fig.4. In particular, is not anymore optimal in contrast to the single-agent case (40). However, when competing against it is beneficial for 1 to employ certain smaller efficiencies .
FIG. 4:
Two non-adaptive agents 1 and 2 compete for a depletable source. Their stored energies (payoffs) are given (resp.) by (38) and (39), while the work-extracting strategies are parametrized by efficiencies η1 and η2. According to (8, 37), η1 and η2 hold 1 − θ0 > η2 > η1. Three situations are shown on the same graph. For ϑ0 = 0.25 blue-dotted lines delimit the domain 0.75 > η2 > η1 > 0, while the blue-shadowed domain indicates on W2(η1, η2) > W1(η1, η2). The rest of the domain delimited by blue-dotted curves refers to W2(η1, η2) < W1(η1, η2). Likewise, green and red lines and shadowed domains refer to ϑ0 = 0.5 and ϑ0 = 0.25, respectively. Thus on curves a, b and c we have W2(η1, η2) = W1(η1, η2) for (resp.) ϑ0 = 0.25, 0.5, 0.75. The blue, red, green points refer to and for (resp.) ϑ0 = 0.25, 0.5, 0.75. The blue, red, green rectangles illustrate the outcomes of emergent prisoner’s dilemma subgame (43– 48) obtained through the best response analysis of the stored energies (38, 39).
D. Best responses and emergent prisoner’s dilemma
1. Best response cycle
Once W1 and W2 in (38, 39) depend on both η1 and η2, the notion of the optimal efficiency is to be studied via game theory. Here the stored energies W1 (η1, η2) and W2 (η1, η2) are payoff functions of (resp.) agents 1 and 2, and the choice of η1 and η2 (strategy profile) refers to their actions.
We can determine the best responses of this game that eventually leads to Nash equilibrium. Starting from an arbitrarily initial point an iterative process of successive best responses is defined recalling (38, 39) and (37):
| (41) |
| (42) |
This iteration converges to a 4-cycle that is independent from the initial point :
| (43) |
| (44) |
where means that is the best response to . Hence the agents can apply the following pairs of strategies:
| (45) |
We emphasize that the efficiency values in (44) depend only on ϑ0 and on the initial assumption η2 ≥ η1 in (38, 39). For example, at ϑ0 = 0.5 we have , , and . Hence, the long-time best response in this game is not a unique Nash equilibrium, but rather the 4-cycle (44). For ϑ0 → 0 (potentially large resources), we get that all efficiencies in (44) converge to 2/3. In contrast, for ϑ0 → 1 they all converge to zero. Fig. 4 shows the location of (45) on the (η1 ≤ η2) plane for several values of ϑ0; see rectangles in Fig. 4. It is seen that they are close to the η1 = η2 line.
Hence, the best response dynamics converges to the limit cycle. This is an asymptomatically stable state, because any deviation converges back to the cycle. Note that the above sequential best response algorithm does not ensure that all Nash equilibria of the game are found. Indeed, cycles of best response are widely observed in generic two-player dynamic games [62]. For adaptive agents studied below, there is no limit cycle due to the existence of global maxima of the payoff functions.
2. Emergent prisoner’s dilemma
Now note that
| (46) |
where (η1, η2) assumes any of 4 (i.e. any of the allowed) pair in (45). Eq. (46) is non-trivial, because it shows that the joint application of the best response strategies (44) loses to the joint application of the efficiency that is optimal for a single agent; cf. (40).
However, this feature of is unstable due to
| (47) |
| (48) |
Eq. (47) means that switching from to or is beneficial for 1. Now 2 responds to this switch in the best way, and the agents find themselves within actions (45) that are worst than due to (46).
This situation does resemble the prisoner’s dilemma [31, 34, 53, 55, 82], where the agents following by best response strategies end up in the worse situation compared with the cooperative behavior, which for our case refers to (46). We emphasize that, in contrast to the standard prisoner’s dilemma, here the best-response strategies are not unique and amount to 4-cycle (43); see rectangles in Fig.4. Note that the 4-cycle also contains a step with equal efficiencies .
E. Adaptive agents
1. Competition between two adaptive agents
Let us consider now the competition between adaptive heat engines (36). In the absence of competition (i.e. for a single agent), the optimal value of α in (36) obtained from maximizing the total stored energy (extracted work) is close to the maximal Carnot efficiency (8), i.e. α → 1 in (36); see Appendix C. Indeed, the single adaptive agent (36) works as long as ϑ ≠ 1 (till the full depletion of the resource). Hence α → 1 minimizes the losses in the energy storage process. Note the difference with (40), where the optimal efficiency for the non-adaptive agent was strictly smaller than the Carnot efficiency. In this sense the usual premise of thermodynamics that the maximal efficiencies would be useful applies to the (single) adaptive agent, which implies advanced functioning mechanisms, as compared to the non-adaptive situation.
For the competition of two adaptive agents 1 and 2 with α1 and α2 in (36), we get for stored energies [see Appendix D]
| (49) |
where ϑ0 is the initial ratio of temperatures, and . Eq. (49) shows that the agent with αk closer to 1/2 wins. Now (36) and (10) imply that αk = 1/2 is (locally) the maximum power regime. Note that similar maximum-power regimes were proposed as an operating principle for ecological and biological systems (including living organisms) [3]. Here, we see that the optimality of maximum-power regime is closely related to the adaptation abilities of agents.
Adaptive agents do face the prisoner’s dilemma, but the situation here is simpler than that discussed in (43-(48). If α1 = α2 → 1, then in (49), and each one will store more energy compared to the case when both operate at the maximum power αk = 1/2. This situation, again, is unstable: if, for example, the first agent switches to the maximum power regime α1 = 1/2 (with the second agent working at α2 → 1), then the first agent will store more energy.
2. Adaptive agent competing with a non-adaptive one
Now we discuss the competition of an adaptive agent (36) against an agent with a time-independent efficiency η. Appendix D studies this situation in detail. Here we summarize the main results.
The adaptive engine operates till the full depletion of the resource. The non-adaptive agent will operate during a finite time τ (see Appendix D for details), i.e. the adaptive agent will still function alone for t > τ. Fig. 5 shows the competition results: the adaptive agent wins whenever it competes against sufficiently small or large values of η. Whenever η is around the initial maximal power value (1 − ϑ0)/2 [cf. (10)], also α should be around the adaptive maximal power regime α = 1/2 for adaptive agent to win. In particular, wins against any η, while α → 1 (i.e. the optimal α for a single agent without competition) looses to any η; see Fig.5. However, as Fig.6 shows, is not the value, where the stored energy of the adaptive agent is maximized.
FIG. 5:
Competition between adaptive agent with efficiency ηad = α(1 − ϑ(t)) for α ∈ [0, 1] and a non-adaptive agent with fixed efficiency η ∈ [0, 1 − ϑ0]; cf. (36, 8). The initial temperature ratios are ϑ0 = 0.1 (blue) and ϑ0 = 0.25 (red). In shaded regions the adaptive agent extracts more total work than the fixed-η agent. Dotted curves show the line for ϑ0 = 0.1 (blue) and ϑ0 = 0.25 (red).
FIG. 6:
Competition between adaptive agent with parameter α in (36) and a non-adaptive agent with a fixed efficiency η. The figure shows stored energies—blue and red for (resp.) adaptive and non-adaptive agents—versus α for a fixed η. Full (dashed) curves refer to , where the initial temperature ratio ϑ0 = 0.1 and μ = 1 in (35).
Recall that and α ≈ 1 determine the optimal efficiencies for the single (without competition) non-adaptive and adaptive agent, respectively. Fig.5 shows that if these values are kept under competition, then the non-adaptive agent wins.
F. Outlook
ATP stores energy: it is metastable at physiological conditions, and has the half life-time of several hours, after which it hydrolyses spontaneously dissipating the stored energy. Thermodynamically, ATP is similar to a high-temperature bath that stores energy at temperature T [47, 48]; cf. the heat-engine model (1-10).
ATP production in cells is an example of energy extraction and storage that occurs via fixed biochemical pathways [85]. Two such pathways of ATP production from glucose are well-known: fermentative (aerobic and anaerobic) and oxidative respiration. There is a form of power-efficiency dilemma here: oxidation pathway results to 18 times greater ATP molecules than by fermentation path, while the fermentation pathway is faster (100 times in muscle cells) and hence is more powerful [76, 83, 85]. These pathways have been extensively studied in competitive environments, both in yeasts [31, 32, 34, 83, 84, 86] and in solid tumor cells [90-93].
We suggest that ATP production paths refer to agents with different internal structure that eventually determines the difference in power and the overall stored energy. Our results in section IV C mean that competition favors lower efficiencies. This is indeed observed in yeasts and tumor cells [31, 32, 34, 83, 84, 86] and also in solid tumor cells [90-93]. Whether also higher powers are favored in our model is a more convoluted question. It does have a straightforward positive answer for the adaptive situation, where the maximal power regime is well-defined at all times.
Note that whenever the resources are shared under a centralized control—i.e. the competion is eliminated and we effectively have a single agent—the optimal efficiency is higher than the efficiency at the maximum power; see (40) and (10). Here the optimal efficiency is defined as the maximizer of the total extracted work. This finding can explain why the oxidative respiration is effectively ubiquitous in multicellular organisms, whereas when the control is lost (e.g. in tumors) cells may switch to fermentative ATP production.
Ref. [31] developed a dynamic model for several populations feeding on a common infinite (i.e. constantly renewable) source. The resource extracted by a population is equalized to its growth per capita. While the model does not hold the laws of thermodynamics (e.g. nothing prevents larger than one efficiencies), it demonstrates the exclusion rule: only the population with the largest power of extraction survives. The size of the survived population is determined by the efficiency of resource extraction. But the proper power-efficiency trade-off is absent [cf. section II], the populations anyhow strive for largest efficiency. It was proposed that this exclusion rule can apply to evolution of ATP production pathways and explain the emergence of multi-cellular organisms: alike cells gather together, thereby exclude competitors and develop less powerful, but more efficient mechanisms of resource extraction [31]. Some results of Ref. [31], e.g. on the relevance of power during the competition, broadly agree with our analysis. However, since our model explicitly agrees with thermodynamics, it provides a richer perspective, e.g. the power uniquely determines the competition outcome only for the adaptive situation. Also, the strict exclusion rule need not hold.
The standard prisoner’s dilemma game was also discussed in the context of agents operating by different ATP production paths [34, 83]. In our situation, an effective sub-game that resembles (but is more complicated than) the prisoner’s dilemma emerges out of the thermodynamic competition between two agents. It is likely that an effective prisoner’s dilemma situation is a reduced description for a more general class of models for agents exploiting the same finite source in accordance with the laws of thermodynamics. We shall explore this hypothesis elsewhere.
V. SUMMARY
The metabolism of living organisms obeys the laws of thermodynamics, while the energy extraction and storage mechanisms are subject to evolutionary pressure [12]. In this work we studied competition between energy-extracting thermodynamic agents modeled as heat engines.
Efficiency and power are two main characteristics of heat engines. They are complementary to each other [16–20]. This known power-efficiency tradeoff [16-20] is also observed in various biological systems [24-34]. The biological situation is however fundamentally more complex, since it involves adaptation to the source, various channels of using the extracted energy, their mutual feedback etc.
Models describing these processes in detail are yet to be developed. Here we treat competing agents as heat engines that extract energy (work) from the available source (high-temperature bath). We focus on the linear thermodynamic regime, where the implementation details of the engine are not essential. Two general scenarios for selection are considered: competition for a fixed energy current (the source is then effectively infinite) and for a depletable (finite) source. In both cases, agents compete indirectly, i.e. the interaction between agents is resource mediated. The optimization targets for these scenarios are (resp.) the power (= energy extracted per time unit) and the stored (= total extracted) energy. Both are relevant biologically.
Competition for the fixed energy current is considered under two known set-ups of game theory: Stackelberg equilibrium [53, 55, 57] and Pareto optimality [54-56]. Once photosynthesis is a heat engine operating between a hot thermal bath (photons generated by Sun) and the cold thermal bath (Earth environment), these set-ups are relevant for plants competing for light. We show that Stackelberg’s set-up reproduces features observed in invading plants [63-73]. Pareto optimality assumes weak cooperative behavior of agents, where the resource allocation is fair and is based on the demands of agents. We implemented the Nash bargaining program [54-56], as a solution for the fair allocation problem. We observed that the optimal efficiencies of competing agents are higher than the optimal efficiency of a single agent (without competition). This is in stark contrast to what happens in the competition for a finite (depletable) source; see below. The proposed scenario of fair resource allocation is relevant for multicellular organisms, e.g. the allocation between germline and soma cells [39-42].
We further examine adaptive and non-adaptive agents competing for depletable resources (finite high-temperature thermal bath). The difference between the two types of agents is the ability of the former to alter its internal structure adjusting it to changes in the exploitable source. Properties of the non-adaptive agent remain fixed in time. The fermentative and oxidative ATP production in cells, which are among the most fundamental, universal biochemical pathways in all life forms, are examples of energy production processes that proceed along fixed pathways [85]. When these alternative pathways of ATP production are considered as competing agents, the adaptation ability of agents is directly related to the phenotypic adaptation of organisms [28, 77-81, 94-97].
The optimal efficiencies of the work-extraction process differ for adaptive and non-adaptive agents. The adaptive agent wins over other agents if its efficiency is that of the maximal power. This however not the efficiency value that maximizes the stored energy in case of competition with non-adaptive agent. The optimal efficiency for the agent with a fixed structure has non-unique (and richer) optimal values. One reason behind this difference is that adaptive agents compete until the final depletion of the sources, whereas non-adaptive agents are unable to do so. We stress that our treatment of adaptation is to some extent formal, since it so far does not account for its full energy cost [37]: future models should show how to direct a part of the stored energy to the needs of adaptation [38].
One general outcome of our model is that agents competing for stored energy face analogues of the classical prisoner’s dilemma [31, 34, 53, 55, 82]. Here the cooperative behavior refers to efficiencies that are optimal in the absence of competition (i.e. for a single agent). The mutually cooperative behavior is still beneficial under competition, but it is not stable, when one agent employs a smaller efficiency (defection). This puts the agents in the best response cycle, where they store a smaller amount of energy. It is possible that prisoner’s dilemma is a general consequence of thermodynamic laws applied to exploitation of a depletable resource.
Overall, we found that simple models with no special assumption beyond the laws of thermodynamics can recapitulate certain features of biological evolution.
Acknowledgments
This research was supported by the Intramural Research Program of the National Library of Medicine at the NIH. A.E.A was supported by SCS of Armenia, grants No. 21AG-1C038 and No. 20TTAT-QTa003. A.E.A. was partially supported by a research grant from the Yervant Terzian Armenian National Science and Education Fund (ANSEF) based in New York, USA. We thank the late Guenter Mahler for discussions on thermodynamics of evolution.
Appendix A: Markov model for heat engine: Structure, Power and Efficiency.
Here we use slightly different notations as compared to the main text. The notations are related as follows, ρi←j ≡ ρij, and
Let us recall the simplest model of heat engine [37]. The model has three states i = 1, 2, 3, which is the minimal number of states a stationary operating heat engine can have, because it ought to be in a non-equilibrium state (i.e. to support one cyclic motion), and because it has to support three external objects: one work-source and two thermal baths. Each state i has energy Ei. Transitions between different states are caused by thermal baths that can provide or accept necessary energies. We assume that the resulting dynamics is described by a Markov master equation
| (A1) |
where pi is the probability to find the system in state i at time t, and where ρij is the transition rate from state j to state i.
We assume that each pair of transitions i ← j and j ← i is caused by thermal baths Tij = Tji = 1/βji, which are in thermal equilibrium (this point will be clarified latter, when we discuss temperature changing process). The equilibrium nature of thermal baths impose detailed balance condition on the transitions [43]
| (A2) |
We take one temperature infinite [37]: β21 = 0. This bath is then a work-source. This important point is explained via the following related arguments. First, note that an infinite temperature thermal bath exchanges energy without altering its own entropy, which is a feature of mechanical device (i.e. sources of work) [37]. Indeed, due to the equilibrium thermodynamic relation dS21 = β21dQ21 = 0 it exchanges energy dQ21 ≠ 0 at zero entropy change dS21 = 0. Second, if an infinite temperature thermal bath interacts to any (positive) temperature bath, then the former bath always looses energy indicating on no additional costs for the transfer of energy that was stored at an infinite temperature. In that sense, the latter energy is freely convertible to any form of heat, as expected from work.
For the thermal bath with temperature Tij, we define Jij as the average energy lost by the bath (for Jij > 0) or gain (Jij < 0) per unit of time. Since each bath causes only one pair of transition we get
| (A3) |
In the stationary (but generally non-equilibrium) state the average energy of the three-levels system is constant:
| (A4) |
and hence we get from (A2, A3, A4) that the sum of energy current nullifies:
| (A5) |
which is the first law of thermodynamics in the stationary state [44].
The stationary probabilities pi are found from (A1):
| (A6) |
where i ≠ j ≠ k, i, j, k = 1, 2, 3.
Using (A2,A3,A6) and noting that ρ12 = ρ21 due to the assumed condition β12 = β21 = 0, we obtain for the average energy currents
| (A7) |
| (A8) |
| (A9) |
The heat engine functioning is defined as
| (A10) |
i.e. the infinite-temperature bath gains energy. Eq. (A7) implies that for the heat engine functioning it is necessary that
| (A11) |
We will assume that ϑ < 1, that is to say T31 (T32) is the temperature of the hot (cold) bath. Eq.(A11) shows that the system will operate as a heat engine for a given ϑ < 1 if .
The efficiency of any heat engine is defined as the result (i.e. the extracted work) divided over the resource (i.e. the energy coming from the hot bath). Under ϑ < 1 and the efficiency η amounts to
| (A12) |
i.e. the efficiency is bounded from the above by the Carnot efficiency ηC. Hence, (A12) is the general message of the second law for the heat engine efficiency [44].
Another important message of (A12,A11,A7) is the power-efficiency trade-off: at the maximal efficiency the power −J12 of the heat engine nullifies. This trade-off is also a general feature of heat engines [19], though it is frequently missed in phenomenological treatments of equilibrium thermodynamics [44]. This trade-off is one pertinent reason for having an explicit microscopic model of a heat engine [19].
a. High temperature regime.
The work power J12 in (A7) depends on the specific form of the transition rates ρij that enter the detailed balance condition (A2). The form of ρij depends on the physical implementation of the model; e.g. the Arrhenius form of ρij applies in chemical reaction dynamics [43]. We shall work in the high-temperature limit, where the details of ρij are not important provided that it holds the detailed balance (A2). Now β32 ≪ 1 and β31 ≪ 1 are sufficiently small, i.e. Eiβ32 ≪ 1 and Eiβ31 ≪ 1, but 0 ≤ ϑ ≤ 1 in (A11) can be arbitrary. In this limit the power of heat engine reads from (A7):
| (A13) |
where we denoted . We shall assume that ρ is a constant. Note that for β31 = β32 = 0 we get ρij = ρji from (A2). Hence in this limit we get pi = 1/3; cf. (A6). Now J21 ≠ 0 in (A13) due to small deviations of pi from 1/3 that exist due to a small but finite β31 and β32 = 0.
In the high-temperature regime our model is fully consistent with linear equilibrium thermodynamics [18, 52]. Structurally similar models were widely employed in bioenergetics for modeling coupled chemical reactions, where the passage of heat from higher to lower temperatures corresponds to the down-hill reaction, where the work-extraction refers to the up-hill reaction [52]. However, we emphasize that a microscopic model clarifies the status of the involved parameters, as well as demonstrates explicitly that the extracted work relates to the stored energy. These two important aspects are not clear within the phenomenological introduction of linear thermodynamics models.
Appendix B: Competition for a fixed energy current
Two agents competing for fixed resources can be described as two heat engines [cf. (A13)]:
| (B1) |
| (B2) |
where are the efficiencies [cf. (8)]. Note that for the photosynthesis θ ≃ 10−3 is a small number.
We write this constraint (11)as
| (B3) |
| (B4) |
where A′ and A″ are positive constants.
a. Game-theoretical approach: actions and utilities
| (B5) |
To simplify the parametrization of the problem we denote
| (B6) |
and write the extracted works as
| (B7) |
| (B8) |
The problem described by (B5-B8) has a game-theoretic interpretation, where 𝓦1 and 𝓦2 refer to utilities of each agent, and (B5) ensures that their actions x1 and x2 are coupled. Hence, for A in (B5) we shall assume
| (B9) |
The first inequality here means that the agent cannot simultaneously maximize 𝓦1 and 𝓦2 at η1 = η2 = (1 − ϑ)/2. The second inequality means that the constraint (B5) still allows some η1 < 1 − ϑ and η2 < 1 − ϑ. Introducing a new constant via A = a(1 + ϵ)(1 − ϑ) we conveniently write (B5) as
| (B10) |
where the last inequalities follow from (B9).
b. Pareto line and the bargaining equilibrium
Pareto’s concept of equilibrium assumes a certain level of passive cooperation between 1 and 2. Now 1 does not increase its utility 𝓦1 without decreasing the utility 𝓦2 of 2, and likewise 2 does not increase 𝓦2 without decreasing 𝓦1 [54-56]. This contrasts to Stackelberg’s solution, where the first moving agent chooses the optimal condition for itself, irrespective of what happens to another agent. Not surprisingly, Pareto’s concept does not provide any unique choice of (x1, x2), and an additional reasoning is needed for selecting a unique outcome among Pareto solutions.
It is seen from (B8, B10) that the set of Pareto equilibria (i.e. the Pareto line) is given by the following three conditions
| (B11) |
Note that there is a continuum of Pareto equilibria. Moreover, for this game all Pareto equilibria are also Nash equilibria. Indeed, recall that the Nash equilibrium looks for a pair such that is the best response to x2 (i.e. the conditional maximum of 𝓦1 given x2), while is the best response to x1 [54-56].
Nash’s bargaining solution is one approach for making a unique choice within (B11) [54-56, 59]. For defining the bargaining target we need to find the worst utility for each agents:
| (B12) |
To find out , note from (B10) that any choice x1 of 1 should hold x1 ≥ a(1 + ϵ) − ϵx2. If 2 arranged its action x2 such that a(1 + ϵ) − ϵx2 ≥ 1, then there is no choice to be made by 1. For such cases we shall define its utility 𝓦1 = 0, and this is clearly the worst outcome for 1. Now a(1 + ϵ) − ϵx2 ≥ 1 or equivalently [a(1 + ϵ) − 1]/ϵ ≥ x2 will work provided that a(1 + ϵ) − 1 ≥ 0. Hence, if the latter condition holds we get 𝓦1 = 0. Otherwise, if a(1 + ϵ) − 1 < 0, then 2 achieves minx2 [𝓦1] when the bound x1 ≥ a(1 + ϵ) − ϵx2 is possibly tight, i.e. for x2 = 0, and then x1 = a(1 + ϵ) = argmaxx1 minx2[𝓦1]. Altogether, we get
| (B13) |
is calculated in the same way:
| (B14) |
Note that only one among and can be non-zero. Indeed, from (B13) and (B14) it follows that , if which is impossible since . The fact of e.g. is important, since it means that 1 cannot be eliminated from the competition. Now (B13, B14) are not simply the separate global minima of (𝓦1, 𝓦2), i.e. generally , e.g. because generally is not reached for any action of 2. (Still we get for ϵ = 1.)
Now the Nash bargaining solution is found from maximizing the geometric mean of and [58]:
| (B15) |
where (x1, x2) should vary along all allowed values defined by x1 + ϵx2 ≥ a(1 + ϵ), 0 ≤ xk ≤ 1, and by the values corresponding the worst outcomes of the players . As it is mentioned above, it is possible that in the worst case 1 will not have a choice for the given action of 2. That is, the action of 1 will be lie outside of the allowed region. Thus, we obtain the following region of maximization for (B15), where the worst outcomes are also take an account.
| (B16) |
Note, that the last bounded region includes the Pareto line of the problem (B11). One may even maximize (B15) over the Pareto line, since the outcome of the maximization will lie on that line [58, 59].
Eq. (B15) maximizes a mean, since 1 and 2 are assumed to be equivalent, and it is the geometric mean, since generally dimensions of utilities 𝓦1 and 𝓦2 can be different. Ref. [58] discusses axioms of the bargaining solution and shows that they are consistent with thermodynamics.
Appendix C: Strategies of work-extraction. Depletable resources.
The dynamics of T31 can be deduced from the formula of equilibrium heat-capacity, which governs the change of temperature for an equilibrium body given the change of its internal energy [44]. Once the changes of T31 are assumed to be slow:
| (C1) |
where C = C31 is the heat capacity of thermal baths. Indeed, C is large, since it scales with the number of the bath degrees of freedom. On the other hand, J31 ~ 1, hence is small, i.e. the change of T31 is slow. We shall make a natural assumption that it is much slower than the relaxation of the engine to its stationary state (A6).
Taking into an account that β32 = const, we find for ϑ = β31/β32 from (A13, A8):
| (C2) |
Different strategies of work-extraction refer to differences in its structural parameters and .
a. Non-adaptive agent
Here we discuss the case, when the heat engine is adapted to the initial temperature ratio, i.e the efficiency η of the heat engine remains fixed in the course of time. Eq. (A13) shows that for a fixed ϑ the maximum power of heat engine (i.e. the maximum of ∣J21∣) is attained for
| (C3) |
Eq. (C3) implies that the maximal (Carnot) efficiency is attained for . Note that in various comparisons between different engine structures it is sensible to keep the difference fixed and vary only . Indeed, in the considered regime , is the difference between the maximal and minimal energies of the heat engine. Hence fixing means to fix the global energy scale.
| (C4) |
| (C5) |
For a constant efficiency of the heat engine η, which includes the cases where the engine is maximizes ∣J12∣ for the initial resource, (C5) is solved as
| (C6) |
where ϑ0 = ϑ(0) is the initial value, and W[z] is Lambert’s function (or product logarithm), which solves equation z = WeW. Note that W[z] → z for z → 0, which determines the long-time behaviour ϑ(t) → (1 − η) of (C6), in this limit the power of of the heat engine nullifies since the efficiency of the heat engines becomes equal to the Carnot level η = ηC = 1 − ϑ(t).
The full dimensionless stored energy is found from (C4) and C6) as The full dimensionless stored energy is found from (C4) as
| (C7) |
| (C8) |
where (C7) follows from
| (C9) |
and (C8) holds due to the definition of W(z). When maximizing (C8) over η for a fixed ϑ0 (i.e. for a fixed initial resource), we find that (C8) is maximized for , which is larger than the value η = (1 − ϑ0)/2, at which the power ∣J12∣ is maximized. Hence, for sufficiently sizable resources it pays to consume them with the efficiency larger than at the maximum of ∣J12∣.
b. Adaptive agent
Next, let us consider the case of adaptive agent, where
| (C10) |
where 0 < α ≤ 1. This involves situations, where the engine is sufficiently complex to monitor the environment and adapt to time-dependent temperatures, i.e. (C10) involves the cases, where the engine maximizes ∣J12∣ for any time-dependent ϑ(t). Now (C5) is solved as
| (C11) |
Using the same method as for (C8), we get from (C10, C11):
| (C12) |
| (C13) |
which makes clear that the maximum of (C13) is reached for α → 1, i.e. once the adaptation (C10) is allowed, the maximal stored energy is reached under the most effective scenario of work-extraction. Note that reaching the α = 1 limit of (C13) demands an infinite time, since ∣J12∣ → 0 for α → 1.
It is worth noting, that the advanced capability of adaptation is not always provide an advantage over the simpler scenarios of adaptation when the heat engine is operating alone, i.e. without competition. Indeed, comparing (C13) and (C8) in the case of the possible maximum efficiency of the heat engine performing by fixed internal structure for the given initial resources ϑ0 it becomes obvious that the stored energy of the heat engine operating by local adaptation scenario is greater than that of performing by fixed internal structure scenario when
| (C14) |
Obviously, the right hand-side of (C14) is always ≤ 1. Thus, the stored energy of the heat engine operating by the local adaptation scenario is greater than that of operating by the fixed internal structure for α → 1. Note that in the case of rich resources ϑ ≈ 0 the stored energies of both heat engines operating on the maximum possible efficiencies will be the same.
Appendix D: Two agents (heat engines) on the same resource
a. Competition of two non-adaptive agents.
Let us now consider two agents (1 and 2) recalling that they are taken to have the same value for and ρ and do interact with the same thermal baths. Instead of (C4, C5) we shall now have (k = 1, 2)
| (D1) |
| (D2) |
According to (D2), ϑ will relax two times faster [than in (C5)] to the rest point . If we assume η1 < η2 than the second agent will need to have a finite life-time determined via η2 = 1 − ϑ(τ2). Otherwise for t > τ2 this agent will not anymore function as a heat engine. Hence the competition for the same resource brings in a finite life-time of functioning as heat engine.
By analogy to (C6) we solve (D2) as
| (D3) |
Hence the total extracted work by the second agent is to be calculated from (D1) as
| (D4) |
where τ2 is the time at which the interaction of the second agent with thermal baths is to be switched off, since for t > τ2 it will cease to function as a heat engine (i.e. extract work). Noting that W−1(z) = zez, we find from (D3,D4):
| (D5) |
| (D6) |
FIG. 7:
Implementation of best response algorithm for ϑ0 = 0.1. Each path starts from different initial values (η1, η2) illustrated by rectangles connected to the path by one-side. An rectangle on each path represents the steps taken by algorithm.
Indeed, we have from (D3):
| (D7) |
| (D8) |
| (D9) |
The work extracted by the first agent reads
| (D10) |
where in the first integral ϑ(t) is given by (D3). In the second integral θ(t) holds a different equation,
| (D11) |
since for times t > τ2 only the first agent couples to the baths.
The first integral in (D10) is calculated analogously to (D6):
| (D12) |
The second integral in (D10) is found from (C8), where we should change η → η1 and ϑ0 → 1 − η2.
Thus the total stored energy of the first agent has the following form
| (D13) |
b. Game theoretical analysis. Emergence of 2 × 2 game.
Let us define the stored energies of each agent (D13) and (D6) by W1(η1, η2) and W2(η1, η2). The stored energies of each agent obviously depends on both efficiency levels {η1, η2}. Hence, both agents face game theoretical situation where its obtained payoff depends also on the chosen strategy of its opponent. Here, strategies of each players are the chosen efficiently levels.
Thus, for each initial temperature ratio having the set of possible strategies {η1, η2–satisfying η1 < η2 ∈ [0, 1 − ϑ0], and payoffs [W1, W2] defined on the strategy profiles {η1, η2} one may try to find the pure Nash equilibrium of the game.
Here we employ the best response algorithm for search. The steps of the algorithm are as follows
initialize the input as arbitrarily point {η1, η2} such that η1 < η2.
Find the maximum of W1 and W2 keeping the opponents efficiency level fixed. update the values {η1, η2} from the maximization.
Repeat until algorithm converges.
A particular implementation of the algorithm is illustrated in Fig.7 for various initialization {η1, η2} and for ϑ0 = 0.1.
The algorithm yields a pair of efficiency levels and respectively for 1 and 2 agents. The values of the obtained efficiency levels for different initial temperature ratio is given in tableI.
TABLE I:
The values of obtained optimal efficiency levels , , , and the optimal efficiency level in the absence of competition for various initial temperature ratio ϑ0.
| ϑ 0 | |||||
|---|---|---|---|---|---|
| 0.1 | 0.687 | 0.416 | 0.427 | 0.447 | 0.427 |
| 0.25 | 0.5 | 0.279 | 0.293 | 0.315 | 0.293 |
| 0.5 | 0.292 | 0.151 | 0.161 | 0.179 | 0.161 |
| 0.75 | 0.13 | 0.06 | 0.07 | 0.08 | 0.07 |
| 0.9 | 0.05 | 0.024 | 0.026 | 0.03 | 0.026 |
As it is seen from the Table.I, the values of optimal efficiencies slightly differ.
These pairs form a cyclic best response , i.e there is no mutually preferable outcome for the agents.
Thus, the best response program yields to 2 × 2 game characterized by the presence of cyclic best responses (known as coexistence games). TableII illustrates an example of the resulted 2 × 2 game.
TABLE II:
2 × 2 game for ϑ0 = 0.1. The first/second number in each cell represents the payoffs of the first/second agent .
| (1.843, 1.726) | (1.770, 1.724) | |
| (1.846, 1.761) | (1.764, 1.764) |
Now, let us start from the cell . We see that for the first player it is better to change its strategy since . Then, the second agent will change its strategy since . In this way we will recover the whole best response cycle.
It has to be noted, that in the resulted 2 × 2 game each player receive less than if they would operate by , which is the optimal efficiency level in the absence of competition.
Indeed, the payoff of each player operating by is obtained from (C8) by interchanging μ → 2μ. For the considered case which is greater than any payoff in Table.II.
However, playing by optimal strategies against is preferable for each agent and , while
The latter situation holds for any initial temperature ratio ϑ0.
Thus, both agents face the well-known prisoners dilemma game.
c. Competition of two adaptive agents.
Let us consider now the competition of locally adaptive heat engines. We will assume that ηk = αk(1−ϑ(t)), k = 1, 2, i.e the efficiencies are varying according to the environmental changes. Putting the efficiencies back into the (D2) we obtain
| (D14) |
where . The only rest point of (D14) is ϑ = 1. Thus, in contrast to the previous case, here both engines operate till the full consumption of the available resources. The solution is found from (C11) by changing μ → 2μ and .
The stored energy of each agent (k = 1, 2) is found by analogy of (C13) and is given by the following expression
| (D15) |
From (D15) it follows that those engines for which α = 1/2 i.e will outperform others. Indeed, according to (A13) an engine operating by the α = 1/2 level is maximizing its power at each moment in time and eventually maximizes the stored energy due to the absence of any time constraints: since both engines are operating as long as ϑ(t) ≠ 1.
Let us discuss competition between two types of heat engines– operating close to the maximum efficiency αme → 1 and operating at maximum power , for the given initial temperature ratio ϑ0.
From (D15) follows that the stored energy of each of the agents operating by near maximum efficiency level αme → 1 will be proportional to . While, the stored energy of the agents operating at maximum power regime is proportional to , note that the prefactor is the same for both cases. Due to αme > αmp it is better to operate at higher efficiency level if opponent does so.
However, it is better to operate at the maximum power regime αmp when opponent is operating at αme, since αmp(1 − αmp) > αme(1 − αme) (the denominator in both expressio is same .
Thus, adaptive agents are facing the same dilemma as non-adaptive agents.
d. Adaptive v.s. non-adaptive agent.
Here we discuss competition of two agents– one with fixed internal structure (non-adaptive) and adaptive agents. In contrast to the above discussed competition scenario, here, only the agent with fixed internal structure has a finite lifetime. Indeed, the temperature ratio dynamics has the following form
| (D16) |
where we have used (C10) for the efficiency of the adaptive agent, and η is an efficiency of the agent with fixed internal structure. The second term of (D16) nullifies only on the point ϑ = 1 , i.e. when the resources are fully depleted. While, the first term nullifies at the time ϑ(τ) = 1 − η.
Solving (D16) by the analogy of the above mentioned procedure we found
| (D17) |
The lifetime τ of the agent with fixed internal structure is found from (D17) and the condition ϑ(τ) = 1 − η.
| (D18) |
The stored energy of non-adaptive agent during its’ lifetime will be equal to
| (D19) |
where the integral is found from (D17) and (D18)
| (D20) |
The stored energy of the adaptive agent is composed of two terms– the one in which another agent has been in the competition and after the turning of that agent. Note that the power of the adaptive agent has the following form . The stored energy of the adaptive agent during the competition time τ is equal
| (D21) |
| (D22) |
where we have denoted ζ ≡ 2 − α + (1 − α)η. After switching of the agent with fixed internal structure the time evolution of temperature ratio is given by (C11), where ϑ0 is substituted by 1 − η. The stored energy of the adaptive agent will be found from (C13) by substituting ϑ0 → 1 − η
| (D23) |
Footnotes
Even for those cases that seem completely isothermal—e.g. the mixing of different gases at the same temperature and pressure—there is clearly a degree of freedom (difference between gases) that is out of equilibrium.
References
- [1].Lotka Alfred J. Contribution to the energetics of evolution. Proceedings of the National academy of Sciences of the United States of America 8.6 (1922): 147. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [2].Svirezhev I, & Logofet DO (1978). The stability of biological communities. [Google Scholar]
- [3].Odum Howard T., and Pinkerton Richard C.. Time’s speed regulator: the optimum efficiency for maximum power output in physical and biological systems. American Scientist 43.2 (1955): 331–343. [Google Scholar]; Odum Howard T. Systems Ecology; an introduction. (1983). [Google Scholar]; Odum Howard T. Efficiencies, size of organisms, and community structure. Ecology 37.3 (1956): 592–597. [Google Scholar]
- [4].Jorgensen Sven Erik, and Svirezhev Yuri M.. Towards a thermodynamic theory for ecological systems. Elsevier, 2004. [Google Scholar]
- [5].Sella Guy, and Hirsh Aaron E.. The application of statistical physics to evolutionary biology. Proceedings of the National Academy of Sciences 102.27 (2005): 9541–9546. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [6].Drossel Barbara. Biological evolution and statistical physics. Advances in physics 50.2 (2001): 209–295. [Google Scholar]
- [7].De Vladar Harold P., and Nicholas H. Barton. The contribution of statistical physics to evolutionary biology. Trends in ecology & evolution 26.8 (2011): 424–432. [DOI] [PubMed] [Google Scholar]
- [8].Wolf Yuri I., Katsnelson Mikhail I., and Koonin Eugene V.. Physical foundations of biological complexity. Proceedings of the National Academy of Sciences 115.37 (2018): E8678–E8687. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [9].Schneider Eric D., and Kay James J.. Life as a manifestation of the second law of thermodynamics. Mathematical and computer modelling 19.6-8 (1994): 25–48. [Google Scholar]
- [10].Martyushev Leonid M., and Seleznev Vladimir D.. Maximum entropy production principle in physics, chemistry and biology. Physics reports 426.1 (2006): 1–45. [Google Scholar]
- [11].Ao Ping. Laws in Darwinian evolutionary theory. Physics of life Reviews 2.2 (2005): 117–156. [Google Scholar]
- [12].Xingbo Yang et al. , PNAS, Physical bioenergetics:Energy fluxes, budgets, and constraints in cells, 118, e2026786118 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [13].Balian R, From Microphysics to Macrophysics, Vol. I, (Springer Science & Business Media, 2007). [Google Scholar]
- [14].Grandy WT Jr. Foundations of Statistical Mechanics. Vol I: Equilibrium Theory (eidel, Dordrecht, 1987). [Google Scholar]; Foundations of Statistical Mechanics. Vol II: Non-equlibrium Phenomena (eidel, Dordrecht, 1988). [Google Scholar]
- [15].Muller I and Weiss W, Entropy and energy: a universal competition (Springer Science & Business Media, 2006). [Google Scholar]
- [16].Novikov II, J. Nuclear Energy II 7, 125 (1958). [Google Scholar]
- [17].Curzon F and Ahlborn B, Am. J. Phys 43, 22 (1975). [Google Scholar]
- [18].Van den Broeck C, Thermodynamic Efficiency at Maximum Power, Phys. Rev. Lett 95, 190602 (2005). [DOI] [PubMed] [Google Scholar]
- [19].Allahverdyan AE, Johal RS, and Mahler G, Work extremum principle: Structure and function of quantum heat engines , Phys. Rev. E 77, 041118 (2008). [DOI] [PubMed] [Google Scholar]
- [20].Allahverdyan AE, Hovhannisyan KV, Melkikh AV, and Gevorkian SG, Carnot Cycle at Finite Power: Attainability of Maximal Efficiency, Phys. Rev. Lett 111, 050601 (2013). [DOI] [PubMed] [Google Scholar]
- [21].Gorshkov VG, Physical and biological bases of life stability: man, biota, environment (Springer Science & Business Media, 2012). [Google Scholar]
- [22].Jusup M, Sousa T, Domingos T, Labinac V, Marn N, Wang Z, Klanjscek T. Physics of metabolic organization. Physics of life reviews. 2017. Mar 1;20:1–39. [DOI] [PubMed] [Google Scholar]
- [23].Sousa T, Mota R, Domingos T, Kooijman SM. Thermodynamics of organisms in the context of dynamic energy budget theory. Physical Review E. 2006. Nov 3;74(5):051901. [DOI] [PubMed] [Google Scholar]
- [24].Wagoner Jason A., and Dill Ken A.. Mechanisms for achieving high speed and efficiency in biomolecular machines. Proceedings of the National Academy of Sciences 116.13 (2019): 5902–5907. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [25].Wagoner Jason A., and Dill Ken A.. Opposing pressures of speed and efficiency guide the evolution of molecular machines. Molecular biology and evolution 36.12 (2019): 2813–2822. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [26].Brown Aidan I., and Sivak David A.. Allocating dissipation across a molecular machine cycle to maximize flux. Proceedings of the National Academy of Sciences 114.42 (2017): 11057–11062. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [27].Angulo-Brown Fernando, Santillán M, and Calleja-Quevedo E. Thermodynamic optimality in some biochemical reactions. Il Nuovo Cimento D 17.1 (1995): 87–90. [Google Scholar]
- [28].Roach Ty NF, et al. Application of finite-time and, control thermodynamics to biological processes at multiple scales. Journal of Non-Equilibrium Thermodynamics 43.3 (2018): 193–210. [Google Scholar]
- [29].Spitze Ken. Chaoborus predation and life-history evolution in Daphnia pulex: Temporal pattern of population diversity, fitness, and mean life history. Evolution 45.1 (1991): 82–92. [DOI] [PubMed] [Google Scholar]
- [30].Tessier Alan J., Leibold Mathew A., and Tsao Jean. “A fundamental trade-off in resource exploitation by Daphnia and consequences to plankton communities.” Ecology 81.3 (2000): 826–841. [Google Scholar]
- [31].Pfeiffer Thomas, Schuster Stefan, and Bonhoeffer Sebastian. Cooperation and competition in the evolution of ATP-producing pathways. Science 292.5516 (2001): 504–507. [DOI] [PubMed] [Google Scholar]
- [32].Aledo J. Carlos and del Valle Alicia Esteban. The ATP paradox is the expression of an economizing fuel mechanism. Journal of Biological Chemistry 279.53 (2004): 55372–55375. [DOI] [PubMed] [Google Scholar]
- [33].Westerhoff Hans V., Hellingwerf Klaas J., and Van Dam Karel. Thermodynamic efficiency of microbial growth is low but optimal for maximal growth rate. Proceedings of the National Academy of Sciences 80.1 (1983): 305–309. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [34].Aledo Juan Carlos, Pérez-Claros Juan A., and Del Valle Alicia Esteban. it Switching between cooperation and competition in the use of extracellular glucose. Journal of molecular evolution 65.3 (2007): 328–339. [DOI] [PubMed] [Google Scholar]
- [35].Ondrechen MJ, Andresen B, Mozurkewich M, Berry RS, Maximum work from a finite reservoir by sequential Carnot cycles American Journal of Physics, 49, 681–685 (1981). [Google Scholar]
- [36].Andresen B, Salamon P, Berry RS, Thermodynamics in finite time, Physics Today, September, 63 (1984). [Google Scholar]
- [37].Allahverdyan AE, Babajanyan SG, Martirosyan NH, and Melkikh AV, Adaptive Heat Engine, Phys. Rev. Lett 117, 030601 (2016). [DOI] [PubMed] [Google Scholar]
- [38].Gorban AN, Tyukina TA, Pokidysheva LI, and Smirnova EI, Dynamic and thermodynamic models of adaptation, Phys. Life Rev 37, 17–64 (2021). [DOI] [PubMed] [Google Scholar]
- [39].Kirkwood Thomas BL. Understanding the odd science of aging. Cell 120.4 (2005): 437–447. [DOI] [PubMed] [Google Scholar]
- [40].Maklakov Alexei A., and Immler Simone. ”The expensive germline and the evolution of ageing.” Current Biology 26.13 (2016): R577–R586. [DOI] [PubMed] [Google Scholar]
- [41].Kirkwood Thomas BL, and Austad Steven N.. Why do we age?. Nature 408.6809 (2000): 233–238. [DOI] [PubMed] [Google Scholar]
- [42].Chen Hwei-yen, et al. Trade-off between somatic and germline repair in a vertebrate supports the expensive germ line hypothesis. Proceedings of the National Academy of Sciences 117.16 (2020): 8973–8979. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [43].van Kampen NG, Stochastic Processes in Physics and Chemistry (Elsevier, Amsterdam, 2007). [Google Scholar]
- [44].Callen H, Thermodynamics and an Introduction to Thermostatistics (Wiley, New York, 1985), 2nd ed. [Google Scholar]
- [45].Josh E Baker. Free energy transduction in a chemical motor model. Journal of Theoretical Biology 228 (2004): 467–476. [DOI] [PubMed] [Google Scholar]
- [46].Takahashi Hideaki, Umino Satoru, Miki Yuji, Ishizuka Ryosuke, Maeda Shu, Morita Akihiro, Suzuki Makoto, and Matubayasi Nobuyuki. Drastic compensation of electronic and solvation effects on ATP hydrolysis revealed through large-scale QM/MM simulations combined with a theory of solutions. The Journal of Physical Chemistry B 121 (2017): 2279–2287. [DOI] [PubMed] [Google Scholar]
- [47].McClare CWF In defence of the high energy phosphate bond. Journal of theoretical Biology 35 (1972): 233–246. [DOI] [PubMed] [Google Scholar]
- [48].Jaynes Edwin T. The Muscle as an Engine. Unpublished Manuscript. 1983. Available online: https://bayes.wustl.edu/etj/articles/muscle.pdf [Google Scholar]
- [49].Jarzynski Christopher. Thermalization of a Brownian particle via coupling to low-dimensional chaos. Physical Review Letters 74 (1995): 2937. [DOI] [PubMed] [Google Scholar]
- [50].Fang Xiaona, Kruse Karsten, Lu Ting, and Wang Jin. Nonequilibrium physics in biology. Reviews of Modern Physics 91 (2019): 045004. [Google Scholar]
- [51].Allahverdyan Armen E., and Wang QA. Adaptive machine and its thermodynamic costs. Physical Review E 87 (2013): 032139. [Google Scholar]
- [52].Caplan SR and Essig A, Bioenergetics and linear non-equilibrium thermodynamics (the steady state), (Har- ward University Press, Cambridge, 1983). [Google Scholar]
- [53].Babajanyan SG, Melkikh AV, and Al- lahverdyan Armen E.. Leadership scenarios in prisoner’s dilemma game. Physica A: Statistical Mechanics and its Applications 545 (2020): 123020. [Google Scholar]
- [54].Luce RD and Raiffa H, Games and decisions: Introduction and critical survey (Courier Corporation, 1989). [Google Scholar]
- [55].Myerson RB, Game Theory: Analysis of Conflict (Har-vard University Press, MA, 1997). [Google Scholar]
- [56].Nash John. Non-cooperative games. Annals of mathematics (1951): 286–295. [Google Scholar]
- [57].von Stackelberg H, The Theory of Market Economy (Oxford University Press, Oxford, 1952). [Google Scholar]
- [58].Babajanyan SG, Allahverdyan AE, and Cheong Kang Hao. Energy and entropy: path from game theory to statistical mechanics. Physical Review Research 2.4 (2020): 043055. [Google Scholar]
- [59].Roth AE, Axiomatic Models of Bargaining (Springer Verlag, Berlin, 1979). [Google Scholar]
- [60].Cressman Ross. Continuously stable strategies, neighborhood superiority and two-player games with continuous strategy space. International Journal of game theory 38, no. 2 (2009): 221–247. [Google Scholar]
- [61].Eshel Ilan. Evolutionary and continuous stability. Journal of theoretical Biology 103, no. 1 (1983): 99–111. [DOI] [PubMed] [Google Scholar]
- [62].Pangallo M, Heinrich T, & Doyne Farmer J (2019). Best reply structure and equilibrium convergence in generic games. Science advances, 5(2), eaat1328. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [63].Gioria Margherita, and Osborne Bruce A.. Resource competition in plant invasions: emerging patterns and research needs. Frontiers in Plant Science 5 (2014): 501. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [64].Keddy Paul A., and Shipley Bill. Competitive hierarchies in herbaceous plant communities. Oikos (1989): 234–241. [Google Scholar]
- [65].Weiner Jacob. Asymmetric competition in plant populations. Trends in ecology & evolution 5.11 (1990): 360–364. [DOI] [PubMed] [Google Scholar]
- [66].Smith Melinda D., and Knapp Alan K.. Physiological and morphological traits of exotic, invasive exotic, and native plant species in tallgrass prairie. International Journal of Plant Sciences 162.4 (2001): 785–792. [Google Scholar]
- [67].Morris Lorna L., Walck Jeffrey L., and Hidayati Siti N. Growth and reproduction of the invasive Ligustrum sinense and native Forestiera ligustrina (Oleaceae): implications for the invasion and persistence of a nonnative shrub. International Journal of Plant Sciences 163.6 (2002): 1001–1010. [Google Scholar]
- [68].Iponga Donald M., Milton Suzanne J., and Richardson David M.. Superiority in competition for light: a crucial attribute defining the impact of the invasive alien tree Schinus molle (Anacardiaceae) in South African savanna. Journal of Arid Environments 72.5 (2008): 612–623. [Google Scholar]
- [69].Alpert Peter, Bone Elizabeth, and Holzapfel Claus. Invasiveness, invasibility and the role of environmental stress in the spread of non-native plants. Perspectives in plant ecology, evolution and systematics 3.1 (2000): 52–66. [Google Scholar]
- [70].Funk Jennifer L. ”The physiology of invasive plants in low-resource environments.” Conservation physiology 1.1 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [71].Funk Jennifer L., and Vitousek Peter M.. “Resource-use efficiency and plant invasion in low-resource systems.” Nature 446.7139 (2007): 1079–1081. [DOI] [PubMed] [Google Scholar]
- [72].Falster Daniel S., and Westoby Mark. Plant height and evolutionary games. Trends in Ecology & Evolution 18.7 (2003): 337–343. [Google Scholar]
- [73].Anten Niels PR. Optimal Photosynthetic Characteristics of Individual Plants in Vegetation Stands and Implications for Species Coexistence. Annals of Botany, 95 (2005): 495–506. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [74].Gruntman Michal, et al. Decision-making in plants under competition. Nature communications 8.1 (2017): 1–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [75].Novoplansky Ariel. Picking battles wisely: plant behaviour under competition. Plant, cell & environment 32.6 (2009): 726–741. [DOI] [PubMed] [Google Scholar]
- [76].Melkonian Erica A., and Schury Mark P.. Biochemistry, anaerobic glycolysis. (2019). [PubMed] [Google Scholar]
- [77].Hammond Kimberly A., and Diamond Jared. Maximal sustained energy budgets in humans and animals. Nature 386.6624 (1997): 457–462. [DOI] [PubMed] [Google Scholar]
- [78].Toloza ERICM, Lam MANDY, and Diamond JARED. Nutrient extraction by cold-exposed mice: a test of digestive safety margins. American Journal of Physiology-Gastrointestinal and Liver Physiology 261.4 (1991): G608–G620. [DOI] [PubMed] [Google Scholar]
- [79].Konarzewski Marek, and Diamond Jared. Peak sustained metabolic rate and its individual variation in cold-stressed mice. Physiological Zoology 67.5 (1994): 1186–1212. [Google Scholar]
- [80].Hammond Kimberly A., et al. Metabolic ceilings under a combination of peak energy demands. Physiological Zoology 67.6 (1994): 1479–1506. [Google Scholar]
- [81].Hammond Kimberly, and Diamond Jared. Limits to dietary nutrient intake and intestinal nutrient uptake in lactating mice. Physiological Zoology 67.1 (1994): 282–303. [Google Scholar]
- [82].Hofbauer Josef, and Sigmund Karl. Evolutionary games and population dynamics. Cambridge university press, 1998. [Google Scholar]
- [83].Schuster Stefan, et al. Use of game-theoretical methods in biochemistry and biophysics. Journal of biological physics 34.1 (2008): 1–17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [84].Veiga Alexandra, Arrabaça João D., and Loureiro-Dias Maria C.. Cyanide-resistant respiration is frequent, but confined to yeasts incapable of aerobic fermentation. FEMS microbiology letters 190.1 (2000): 93–97. [DOI] [PubMed] [Google Scholar]
- [85].Voet Donald, Voet Judith G., and Pratt Charlotte W.. Fundamentals of biochemistry: life at the molecular level. John Wiley & Sons, 2016. [Google Scholar]
- [86].MacLean R. Craig, and Gudelj Ivana. Resource competition and social conflict in experimental populations of yeast. Nature 441.7092 (2006): 498–501. [DOI] [PubMed] [Google Scholar]
- [87].Kedem O, and Roy Caplan S. Degree of coupling and its relation to efficiency of energy conversion. Transactions of the Faraday Society 61 (1965): 1897–1911. [Google Scholar]
- [88].Stucki, Jörg W. The optimal efficiency and the economic degrees of coupling of oxidative phosphorylation. European Journal of Biochemistry 109.1 (1980): 269–283. [DOI] [PubMed] [Google Scholar]
- [89].Torres JL Natural selection and thermodynamic optimality. Il Nuovo Cimento D 13.2 (1991): 177–185. [Google Scholar]
- [90].Heiden Vander, Matthew G, Lewis C. Cantley, and Craig B. Thompson. Understanding the Warburg effect: the metabolic requirements of cell proliferation. science 324.5930 (2009): 1029–1033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [91].Liberti Maria V., and Locasale Jason W.. The Warburg effect: how does it benefit cancer cells?. Trends in biochemical sciences 41.3 (2016): 211–218. [DOI] [PubMed] [Google Scholar]
- [92].Zheng JIE Energy metabolism of cancer: Glycolysis versus oxidative phosphorylation. Oncology letters 4.6 (2012): 1151–1157. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [93].Hanahan Douglas, and Weinberg Robert A.. Hallmarks of cancer: the next generation. cell 144.5 (2011): 646–674. [DOI] [PubMed] [Google Scholar]
- [94].Forsman Anders. Rethinking phenotypic plasticity and its consequences for individuals, populations and species. Heredity 115.4 (2015): 276–284. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [95].Meyers Lauren Ancel, and Bull James J.. Fighting change with change: adaptive variation in an uncertain world. Trends in Ecology & Evolution 17.12 (2002): 551–557. [Google Scholar]
- [96].Stearns Stephen C. The evolutionary significance of phenotypic plasticity. Bioscience 39.7 (1989): 436–445. [Google Scholar]
- [97].Piersma Theunis, and Drent Jan. Phenotypic flexibility and the evolution of organismal design. Trends in Ecology & Evolution 18.5 (2003): 228–233. [Google Scholar]
- [98].Laland Kevin N., et al. The extended evolutionary synthesis: its structure, assumptions and predictions. Proceedings of the Royal Society B: Biological Sciences 282.1813 (2015): 20151019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [99].Odling-Smee John. Niche construction. University of California Press, 2012. [Google Scholar]
- [100].Constant Axel, et al. A variational approach to niche construction. Journal of the Royal Society Interface 15.141 (2018): 20170685. [DOI] [PMC free article] [PubMed] [Google Scholar]








