Skip to main content
Springer logoLink to Springer
. 2024 Mar 5;191(3):35. doi: 10.1007/s10955-024-03236-5

Multicyclic Norias: A First-Transition Approach to Extreme Values of the Currents

Matteo Polettini 1, Izaak Neri 2,
PMCID: PMC10914643  PMID: 38455591

Abstract

For continuous-time Markov chains we prove that, depending on the notion of effective affinity F, the probability of an edge current to ever become negative is either 1 if F<0 else exp-F. The result generalizes a “noria” formula to multicyclic networks. We give operational insights on the effective affinity and compare several estimators, arguing that stopping problems may be more accurate in assessing the nonequilibrium nature of a system according to a local observer. Finally we elaborate on the similarity with the Boltzmann formula. The results are based on a constructive first-transition approach.

Keywords: Extreme value statistics, Effective affinity, Statistical mechanics of irreversible phenomena

Introduction

Let us first provide an illustrative simple example of some of our main results. Perform a random walk on the network

graphic file with name 10955_2024_3236_Equ90_HTML.gif

where every time you cross one specific edge in one direction you get 1€, and when you cross it in the opposite direction you pay 1€. All other transitions also give and take credit, but in liras. Initially your pockets are empty of euros, and while you have a virtually infinite reservoir of liras your bank would not convert them into euros.

The question we pose and answer in this paper is: assuming that you live forever, what is the probability f- that you eventually go broke (reach -1€, that you cannot actually afford to pay)?

In the special case where you are initially in (so that at least you get a chance to not get immediately broke!), we find, on the assumption F>0,

f-=exp-F, 1

with

graphic file with name 10955_2024_3236_Equ2_HTML.gif 2

where each diagram implies multiplication by the corresponding rates of the Markov chain. If instead F<0, the probability to go bankrupt is 1.

In the case where the diagonal transition has vanishing rates in both directions, in the last expression the diagrams containing diagonal terms disappear and we are left with the log-ratio of two cyclic contributions

graphic file with name 10955_2024_3236_Equ3_HTML.gif 3

This object is known as cycle affinity, a measure of the probability of performing the cycle in one direction relative to the opposite. For unicyclic networks (“norias”) Eq. (1) boils down to a remarkable formula first obtained by Bauer and Cornu [1] (which in fact is slightly more general in that the initial state can be chosen arbitrarily). As reasonable, the cycle is completed more often in the favourable direction (A>0) rather than the unfavourable one; then the Bauer-Cornu formula yields the probability of the rare event of the cycle to ever be completed more often in the unfavourable direction. Generalizations have been proven for the total entropy production (a weighted balance of euros and liras) using the tools of martingale theory [24],1 and discussed in the light of first-exit time problems [5].

Beyond the probabilistic interpretation, cycle affinities also afford two different thermodynamic interpretations, one global and one local, of which we give here an intuitive sketch (but see Sec. 3.2 for details), on the assumption that forward and backward rates of a transition xy have ratio expδqxy/Txy with δq the heat (in our example: currency) exchange along that transition and T the temperature (in our example this could be the interest rate) of that transition, i.e. a measure of how inconvenient it is to perform that transition (to borrow money). Then the global one is as Carnot’s entropy production along a cyclic process [6]

A=δqT 4

where is a shorthand for the sum over cyclic transitions. This takes into account all contributions, e.g. from liras and from euros, even if for the problem at hand liras to not play any role. Thus this latter interpretation has little operational value. The local one is

graphic file with name 10955_2024_3236_Equ5_HTML.gif 5

in terms the specific transaction in euros, its temperature Inline graphic, and the value of such temperature T at which forward and backward euro transactions happen with the same frequency (that is: the rest of the world does not care much of whatever you do with your euros).

Going back to our main result Eq. (1) (which, we remind, holds for arbitrary non-unicyclic networks), here the effective affinity does not have such a simple global interpretation, but still retains the local interpretation. What is lost with respect to norias is the independence from the initial state: while it does not matter where you are initially along a cycle to complete that cycle, it does matter where you are in a network to perform an arbitrary composition of cycles that touch the initial state. We generalize Eq. (1) appropriately.

We then show by computational examples that first-passage and extreme value problems such as the one above might give a better estimate of the effective affinity than do fluctuation and fluctuation-dissipation relations at fixed stopping time. Finally, Eq. (1) is reminiscent of Boltzmann’s formula, but turned upside down. We linger on this analogy towards the conclusions.

Framework

State-Space Processes

Consider an irreducible continuous-time Markov chain on finite state space X. We characterize it in terms of the probability ptX={ptX(x),xX} of being at x at time t, which satisfies the continuous-time master equation

ddtptX(x)=x(x)Xr(x|x)ptX(x)-r(x|x)ptX(x) 6

starting from a given initial distribution p0X, with non-negative rates r(x|x) of jumping from x to x. We also define the continuous-time (adjoint) generator R with matrix entries

Rx,x=r(x|x)-δx,xr(x) 7

where δ is Kroenecker’s delta and

r(x)=xXr(x|x) 8

is the exit rate out of a state. The master equation reads in vector form ddtptX=RptX and its stationary distribution solves RpX=0. From now on we do not specify the range of summation unless necessary.

We now focus on a pair of connected states, namely x,x=1,2 without loss of generality. We assume that edge 12 is not a bridge, that is, that its removal does not disconnect the system, and denote X (or simply ) a system where edge 12 is removed. Transitions between these states are deemed to be visible to an external observer. Let L={=21,=12} denote transitions between such states, to and from, and t(·),s(·){1,2} the source and target states of a transition, i.e. t()=s()=1 and t()=s()=2.

Letting n() be the number of times transition occurs along a realization of the process, we define the visible activity and the cumulated current respectively as

n=n()+n(), 9
c=n()-n(). 10

Notice that they typically grow linearly in time; thus we denote the mean stationary current (i.e. cumulated current per unit time) as

c˙=r(1|2)pX(2)-r(2|1)pX(1). 11

Our final goal is to compute the probability f± that the cumulated current c takes value ±1 at least once as the process unfolds from time t=0 to time t+.

Transition-State Processes

Our strategy is to lift the description of the process from state space X to transition space L, following the treatment of Ref.  [7]. The central objects to be calculated are the trans-transition probabilities p(|) that the next observable transition is given that the previous was . An intuitive way to go about this would by brute-force coarse-graining of a stochastic trajectory x,τ=x0,τ0x1,τ1xk,τk in state space, where xj are the visited states and τj are the permanence times. Then p(|) can be computed by marginalization of the probability density function at fixed number of jumps k

fk(x,τ)=i=1k-1r(xi+1|xi)e-r(xi)τi 12

by integrating away the intermediate times and summing over all trajectories between the target state of and the source state of but that otherwise do not contain observable transitions, and multiplying by the rate of this latter transition. This direct procedure is illustrated in the Supplemental Material of Ref.  [7].

A more elegant line is the following. Notice that with probability one the visible activity takes any positive integer value and at any given time t does not depend on future information. Thus the time when the activity reaches a certain value n for the first time is a valid stopping time. Then by the strong Markov property [8] the event of being at x after n visible transitions is also a Markov process in state space. Let pnL() be the probability that the n-th visible transition is . Notice that the probability that the next transition is given that the previous was only depends on the target state of . Thus we conclude that pnL satisfies a discrete-time Markov chain in transition space

pn+1L()=Lp(|)pnL() 13

evolving from some initial probability p1L() that the first transition is . The p(|) are the the so-called trans-transition probabilities; we arrange them in a trans-transition matrix P with entries P,=p(|).

Both the initial transition probability and the trans-transition probabilities can be obtained from the initial state probability p0X and the transition rates r(x|x) by solving first-transition time problems. In particular the probability that, starting from x, is the first visible transition and that it occurs in the time interval [t,t+dt) is given by

r(t()|s())exptSs(),xdt 14

where S is the matrix obtained from R by setting to zero the off-diagonal entries corresponding to the visible transition, namely

Sx,x=Rx,x,for(x,x)(1,2),(2,1),S1,2=S2,1=0. 15

By integrating Eq. (14) from t=0 to infinity and evaluating at x=t() we find, for all ,L, the trans-transition probabilities

p(|)=-r(t()|s())[S-1]s()t(), 16

and, for all L, the probability of the first transition

p1L()=-r(t()|s())x[S-1]s(),xp0X(x). 17

The invertibility of matrix S is granted by the fact that, as a corollary of the Perron-Froebenius theorem [9], its Perrron root is strictly smaller than that of R, which is zero (see [7, Appendix] for more details). There it was also proven that trans-transition probabilities and the initial transition probability are positive and normalized, as they should be:

1=p1L()+p1L()=p(|)+p(|),forL. 18

Explicitly, the trans-transition matrix is given by

P=1ν+ν-ν0ν-νννν-ν 19

where, letting A\(x1,,xn|x1,,xn) be a matrix from which rows x1,,xn and columns x1,,xn are removed, we have

ν=r(1|2)r(2|1)detR\(1,2|2,1)ν=r(1|2)detR\(2|1)ν=r(2|1)detR\(1|2). 20

A proof of these expressions is given in Appendix A.1.

Results

Statement and Derivation of the Main Result

We can now formulate our problem of calculating the probability that the cumulated current ever hits value -1 (case +1 for later) as

f-=n=1f-(n) 21

where f-(n) is the probability that the cumulated current c takes value -1 for the first time at the n-th visible transition. The first is just the probability that the transition occurs right-away:

f-(1)=p1L(). 22

Notice instead that the cumulated current cannot be -1 after two visible transitions:

f-(2)=0. 23

For the cumulated current to be -1 for the first time at the third visible transition, we need that the first visible transition is and the second and third are , therefore:

f-(3)=p(|)p(|)p1L(). 24

To go beyond, first notice that all probabilities at even n vanish. At odd n, we need to count all different paths of 2n+1 steps that perform a transition leading from c=0 to c=-1 for the first time as the last step, and multiply each path by the corresponding probability. Namely, we need to count all different sequences (1,2,,2n+1) such that:

  1. 1= and 2n+1=;

  2. at any intermediate step the number of is never greater than the number of and at 2n the number of is exactly equal to the number of ;

  3. they have a given amount kn of | trans-transitions (which also fixes the number of |, |, and | trans-transitions).

In fact, this question maps to a well-known enumeration problem: if we replace with a 45 unit segment and with a -45 unit segment, we need to count all “mountains” of length 2n that can be drawn without lifting the pencil and that have exactly k peaks (see Fig. 1). This problem is well-known to be solved by the Narayana numbers [10]

N(n,k)=1nnknk-1. 25

Therefore for n1 the result to our problem is

f-(2n+1)=p1L()p(|)p(|)k=1nN(n,k)[p(|)p(|)]n-k[p(|)p(|)]k. 26

Notice that the prefactor in Eq. (26) accounts for the initial transition (which must be ), for the last transition (which must be given that the previous was also ), and for the fact that in such mountains valleys are one less than peaks.

Fig. 1.

Fig. 1

A mountain with n=5 up and down slopes, 3 peaks and 2 valleys

We now use the fact that Narayana numbers admit the generating function [11]

G(x,y)=n1k=1nN(n,k)xnyk=1+x(1-y)-1-2x(1+y)+x2(1-y)22x-1. 27

Then, letting

x=p(|)p(|) 28
y=p(|)p(|)x, 29

we find that

f-=p1L()+p1L()p(|)p(|)G(x,y). 30

Now notice that, using normalization of the trans-transition probabilities Eq. (18), we have

x(1-y)=p(|)+p(|)-1,x(1+y)=2p(|)p(|)-p(|)-p(|)+1. 31

After some tedious but revealing calculation (see appendix A.2 for details) one obtains that the square root in Eq. (27) has real-valued solution

1-2x(1+y)+x2(1-y)2=|p(|)-p(|)| 32

in terms of the absolute value |·|. We then find the remarkably simple expression

G(x,y)=p(|)/p(|),ifp(|)<p(|)p(|)/p(|),ifp(|)p(|). 33

Plugging this latter into Eq. (30) we find our central result

f-=min1,p1L()+p1L()p(|)p(|)p(|)p(|), 34

where the two values are obtained respectively for p(|)<p(|) and for p(|)p(|). To express f- as a minimum between two values we used the fact that, because p(|)/p(|)=[1-p(|)]/[1-p(|)], the second value is monotonically increasing in p(|) and decreasing in p(|), and is only 1 for p(|)=p(|).

Now notice that the stationary distribution in transition space (eigenvector of the trans-transition matrix relative to eigenvalue 1, PpL=pL) is easily found to be pL()p(|¯), where ¯ denotes the reverse transition of (i.e. ¯=, ¯=). Therefore we can rewrite the above expression as

f-=min1,p1L()+p1L()p(|)pL()p(|)pL(). 35

Now consider the probability f+ that the cumulated current ever reaches value +1. A quick review of the above derivation promptly leads to

f+=minp1L()+p1L()p(|)pL()p(|)pL(),1, 36

where the two values are taken respectively for p(|)<p(|) and for p(|)p(|).

The Effective Affinity

Let us define

F=logp(|)p(|). 37

This quantity has been given an operational thermodynamic interpretation in Refs.  [12, 13] as follows. By Eqs. (19) and (20) we have

F=logr(1|2)[detR\(2|1)-r(2|1)detR\(1,2|2,1)]r(2|1)[detR\(1|2)-r(1|2)detR\(1,2|2,1)]=logr(1|2)p(2)r(2|1)p(1) 38

where in the second expression p=limtpt is the stationary probability of the system where transition 12 is removed, i.e. Rp=0 (see Appendix A.2 for a direct proof; the distribution is unique by the assumption that edge 12 is not a bridge). Notice that this is a stalling system, that is, one where (by non-existence of the transition!) the mean stationary current c˙=r(1|2)p(2)-r(2|1)p(1) vanishes.

Let us now parametrize rates according to the principle of local detailed balance [14, 15]

r(x|x)r(x|x)=expδqxxTxx 39

in terms of a energy increment δqxx=-δqxx and a temperature profile Txx=Txx describing the influence of a local bath’s degrees of freedom. We assume that temperature T12 is specific of transition 12 (that is, its variation does not affect other rates). Then it was proven [12] that there exists a value T12 for which the mean current stalls (but here the transition is possible!). Nevertheless, a simple argument shows that the stationary values p(2) and p(1) are the same as in the system where the transition is removed altogether (see Appendix A.3). We therefore have

0=c˙=r(1|2)p(2)-r(2|1)p(1) 40

leading to p(2)/p(1)=-δq12/T12 and

F=δq121T12-1T12. 41

This latter local expression grants an operational procedure to measure F, on the assumption that δq12 is measured or theoretically determined by a microphysical theory of the system describing energy levels, that T12 is tunable, and that the mean current c˙ is observable. The procedure consists in tuning T12 to the value T12 for which the observable mean current vanishes. Then, if δq12 is known, F is determined in terms of the inverse temperature difference.

As regards the global acceptation of affinity mentioned in the introduction, for systems containing a single oriented cycle C it is easily shown [16, 17] that F=A is the cycle affinity, namely the ratio of the products of rates along the cycle, in opposite directions

A=log(xx)Cr(x|x)r(x|x)=(xx)CδqxxTxx=δqT. 42

For vanishing A (Kolmogorov condition) one finds an equilibrium state with vanishing mean current. From the above relation one immediately finds for the equilibrium temperature the relation

δq12T21=-(12)(xx)CδqxxTxx. 43

For generic multicyclic systems, this latter identification with a specific thermodynamic cycle is not possible. However, the cumulated current c=Cc(C) can in fact be envisioned as the sum of the winding numbers over all cycles that include the visible transition (see Refs.  [18, 19] for some insights on such winding numbers). Notice that a stalling mean current does not imply global equilibrium, as these cycles may have circulation even if overall the visible mean current stalls. An explicit expression of F in terms of such cycles is

F=logCw(C)(xx)Cr(x|x)Cw(C)(xx)Cr(x|x) 44

where w(C) is some cycle weight, independent of the cycle’s orientation [12]. Nevertheless, defining entropy production as the Kullback–Leibler distance of random processes from their time-reversed, it has been shown that Fc˙ is indeed the entropy production estimated by an external observer who only has access to the sequence of visible transitions [16, 17].

Special Cases and the Noria, and a Generalization

We consider two special cases where our main results write in terms of the effective affinity. Here we resolve the explicit dependency of the stopping probability in terms of the probability p1L of the first transition, f±=f±p1L. Remember that such probability can eventually be computed from the initial probability in state space p0X via Eq. (17). Finally we generalize the above results to the probability of hitting arbitrary low values.

Stationary Case

In the first case we sample the initial transition from the stationary distribution. We easily find from Eqs. (35) and (36)

f-pL=min1,pL()(1+exp-F), 45
f+pL=minpL()(1+exp+F),1. 46

From an operational point of view this is particularly simple because it only requires to wait long enough for the system to stationarize. Then pL can be computed explicitly from the time series of the transitions, by just counting the relative frequency of ’s and ’s.

Cyclic Case

In the second case, we prepare the system just after a visible transition is performed and then wait for the same transition to occur again, thus completing a cycle. Therefore for c=+1 we prepare the system at the tipping point of , which gives p0X(x)=δx,1 so that, after Eq. (17) is applied, p1L()=p(|). For c=-1 we prepare the system at the tipping point of , which gives p0X(x)=δx,2 and p1L()=p(|). After some calculation trick such as

p(|)1+p(|)p(|)=p(|)1+1-p(|)p(|)=expF 47

we find

f-p(·|)=min1,exp-F, 48
f+p(·|)=minexp+F,1. 49

This result is analogous to the one derived in Ref.  [1] for unicyclic systems, with the exception that in the unicyclic case the choice of initial state (or, equivalently, the final transition) is not relevant, given that all states share the same cycle and therefore the explicit dependency on the initial state drops and the above result simplifies to

f±[·]=min1,exp±A 50

where f±[·] is just the probability that the cycle is ever completed in either direction, independently of the initial state.

Hitting -n

The above hitting result for the cumulated current to ever become -1 (for F>0) lends itself to a simple generalization to the case of the cumulated current hitting value -n, for nN. Intuitively (given that denumerable + denumerable = denumerable) this is just given by reiterating the hitting problem (renewal property), with the initial condition stabilizing to the previous occurrence of just after the first occurrence. One immediately obtains

f-n[p1L]=f-1[p1L]f-1[p(·|)]n-1=p1L()eF+p1L()ee-nF, 51

where we rewrote p(|)/p(|)=expF as the effective affinity of a system whose trans-transition matrix P has the columns swapped with respect to P; interestingly this auxiliary dynamics also plays a role in formulating the transient fluctuation relation in Ref.  [7], but its physical interpretation has still to be clarified.

Fluctuation Relations

In the unicyclic case, one easily finds the fluctuation relation

f+[·]f-[·]=expA. 52

In the multicylic case, from Eqs. (49), (48) we have

f+p(·|)f-p(·|)=expF. 53

This looks formally like a fluctuation relation, with a caveat: in fluctuation relations the probabilities being compared should be the same, while in this case they are different probabilities, as they are conditioned on two different initial distributions, viz. p(·|) and p(·|). This, as we will see, has consequences on the computational or experimental interpretation of data, given that one should prepare different experiments for forward and backward processes and post-select their outcome, which is not desirable. In the next section we comment further on this aspect, arguing that Eq. (53) may in fact be the best chance of an estimator of nonequilibrium despite approximations.

Furthermore, in Ref.  [7] it was proven (Eq. (21)) that, by sampling the initial transition from distribution p1L()p(|), the following fluctuation relation holds

pn(c)pn(-c)=expcF, 54

where we remind that c is given by Eq. (10) and pn(c) is the probability that the cumulated current is a certain value cZ after n visible transitions. One can then further derive the relation

nNpn(+1)nNpn(-1)=expF 55

where N is any subset of N. This is reminiscent of Eq. (53), but notice that these latter are not independent probabilities.

Finally, fluctuation relations for single edge currents at stopping times different than the total number of visible transitions (in particular at “clock time” t) do not generally hold – but in the unicyclic case – because the statistics of a specific current depends on all other currents flowing through the network. This is what makes relations such as Eqs. (53) and (55) particularly appealing, as they are local and phenomenological, and do not depend on knowledge of the whole system.

Estimation of the Effective Affinity

Many of the above expressions can be used to build estimators of the effective affinity. We will focus on the ones coming from cyclic processes.

Consider M independent realizations of a trajectory performing N visible transitions:

1(m),2(m),,N(m),m[1,M]. 56

Define the cumulated current after the n-th visible transition

c^n(m)=k=1nδk(m),-δk(m),. 57

It has empirical distribution

p^n(c)=m=1Mδc^n(m),c,forc[-n,n] 58

and empirical mean and variance

c^n=1Mm=1Mcn(m)=c[-n,n]cp^n(c), 59
c^n2=1Mm=1M(c^n(m)-c^n)2=c[-n,n]c2p^n(c)-c^n2. 60

Define the empirical stopping times

N^±(m)=inf{n[0,N]s.t.c^n(m)=±1}{N+1} 61

and the estimators of the stopping probabilities

f^±=min1-1Mm=1MδN^±(m),N+1,1M 62

where the minimum is introduced to avoid possible divergences in the case f^±=0 (see also Eq. (25) in Ref.  [20]).

Notice that due to the finite cutoff on the number of transitions, given Eq. (21) these latter are biased. In particular they systematically underestimate (on average) the true stopping probability due to the fact that all occurrences of c=±1 after N visible transitions are discarded.

Assuming that we can ignore the initial conditions, we can invert Eqs. (48) and (49) to obtain an estimator of the effective affinity

F^cy=logf^+,iff^+f^-,-logf^-,iff^+>f^-. 63

We can compare this to the estimator coming from the stopping fluctuation relation

F^fr=logf^+-logf^-, 64

which is generally biased due to the different initial conditions in Eq. (53).

We complement these stopping-problem estimators with an estimator coming from the theory of linear response out of stalling states [21]

F^lr=2c^Nc^N2 65

and with an estimator obtained from the standard entropy production expression as a Kullback–Leibler divergence (properly regularized to avoid taking log0)

F^kl=1c^Nc[-N,N]p^N(c)p^N(-c)0p^N(c)logpN^(c)p^N(-c). 66

This latter is well-known to be a biased estimator, and better practices in evaluating relative entropies correct these biases but also greatly increase the running time (see Supplementary Material in Ref.  [16]). We do not concern ourselves with this issue here.

In Fig. 2 we compare the behaviour of these estimators in a simple model. The linear regime estimator F^lr performs better near the stalling condition F=0, while it diverges significantly out of stalling. On the contrary, the cyclic estimator F^cy converges far from stalling, but it systematically suffers from the finite N cutoff. The entropy production estimator F^kl is also biased and noisy due to the tails of the cumulated current’s distribution. The stopping fluctuation-relation estimator F^fr instead appears to not be affected by all these issues, despite the approximation due the bias in the different initial state in Eq. (53).

Fig. 2.

Fig. 2

For a fully-connected four-state model with all unit rates except for R1,2=expF and initial state x=1 (that is p0X(x)=δ1,x): (dashed) the effective affinity F; (continuous) estimator F^fr of logf+/f-; (crossed) estimator F^cy of sign(f+-f-)logfσ; (bullets) the linear regime estimator F^lr; (triangles) the entropy production estimator F^kl. The ultimate stopping time was set to N=20 and the number of samples to M=10,000

The reason behind the left-right asymmetry in the above plot is due to the fact that for simplicity we decided to only perturb one rate R1,2=expF and keep all others fixed. This choice is useful to show that entropy production estimators can lead to noise in the tails depending on time-scale separation between rates. Had we distributed the perturbation among R1,2 and R2,1, we would have obtained a more symmetric plot.

Discussion

A Nonequilibrium Boltzmann Formula?

Our Eq. (1) can be seen as a “nonequilibrium Boltzmann formula” given its similarities with S=logW connecting entropy S and probability 1/W (W being the volume of state space), elaborated by Boltzmann and refined by Planck (Boltzmann’s constant set to unity). But with some precautions.

Einstein wrote about the Boltzmann formula: «To be able to calculate W, one needs a complete theory of the system under consideration. If considered from a phenomenological point of view [this] equation appears devoid of content». Einstein then inverted the equation to make it a rule for inferring probabilities from measured entropy differences between equilibrium states – which better capture the dynamical nature of processes – and used it to perfection Smoluchowski’s theory of critical opalescence [22]. However, at least since Kant, philosophers warn us that observations are not independent of conceptions, and therefore deduction from measurements needs theory (the fluctuations of what?), and theory needs the human touch. Still now we don’t know which came first, whether the chickens of gases and thermal machines or the eggs of thermodynamics and statistical mechanics [23]. At equilibrium the situation is aggravated by the fact that the construction of thermodynamic potentials requires many arbitrary choices by the observer [24], while the pursue of objectiveness requires a description of processes in terms of invariant quantities.

Far from equilibrium, flows of heat to and from the environment are not quantified by differences of a state function, but by “inexact differences”. By the so-called principle of local detailed balance ratios of probabilities of forward-to-backward processes have been connected to so-called affinities that quantify the entropy production along cyclic processes, and which are invariant upon the redefinition of the fundamental degrees of freedom [24]. However, until recently it has proven difficult to directly connect probabilities and meaningful physical quantities. In fact, despite some claims, there are no predictive variational principles far from equilibrium [25, 26].

However, a concern still plagues our result. What comes first: the egg of F, or the chicken of f-? Only circumstances can tell.

Relation to a Companion Publication

The present manuscript is strictly related to a companion work [27] by the same Authors that addresses similar questions. Let us clarify in which ways.

Equation (51) is strictly related to Eq. (14) in Ref.  [27]. There the normalized probability p-n of the cumulated current taking minimum value -n is addressed, while in our case f-n allows that, after hitting value -n, the cumulate current may take even more negative values. Therefore we have, intuitively, that this latter is the cumulative distribution of the former

f-n=knp-k. 67

Given that p-k is normalized, this identification allows to estimate the escape probability that the cumulated current never actually attains a negative value as p0=1-f-1, which in view of Eq. (34), the explicit expression for the trans-transition probabilities Eqs. (19) and (20), and the explicit expression for the probability of the first transition Eq. (17) allows to express p0 in terms of the (distribution of) the initial state (see below the explicit expression).

The other main difference between the two works is methodological. Here we follow a constructive but specific approach based on first-transition time techniques and combinatorics, while Ref.  [27] is rooted in the more general theory of martingales. In particular in Ref.  [27] it is shown that, upon a proper choice of initial state, exp-Fc is a martingale, and in particular its expected value exp-Fc is constant in time. Doob’s optional stopping theorem then states that this time can be any proper stopping time. By choosing the moment when the cumulated current hits the boundary values n+>0 or n-<0 for the first time, and given that c starts from value 0, one obtains

1=exp-Fc=f+(n-)e-Fn++f-(n+)e-Fn- 68

where f-(n+) is the probability of hitting n- whilst not hitting n+. The nonequilibrium Boltzmann formula follows by taking n-=-1 and n+, in which limit f-1(n+)f-1. Interestingly, similar formulas were derived in an optimization context in Ref.  [28].

Finally, here is a short dictionary of equivalent terms and concepts in the two papers: transition rates r(x|x) here are kuv there; the observed edge 12 is yx; “cumulated” currents c are “integrated” currents J; for stationary probabilities we have instead of “ss”; the effective affinity F is a; the extremum probability p-n[p1L], given Eq. (17), is pJxyinf(-|X(0)=x0); the escape probability p0 is

pesc(x0)=1+kxy[S-1]x,y[S-1]y,x0[S-1]y,x+kyx[S-1]y,y[S-1]x,x0[S-1]x,xe-a 69

where S is the matrix with entries Su,v=kvu-δu,vwX;wukuw if (u,v)(x,y),(y,x), else Sx,y=Sy,x=0, and we used the explicit expression of the effective affinity Eq. (37), that now translates into

a=logkxy[S-1]x,ykyx[S-1]y,x. 70

When x0=x we find

pesc(x)=1+kxy[S-1]x,y+kyx[S-1]y,ye-a=1-kxydetS\(y|x)-kyxdetS\(y|y)detSe-a=1-e-a, 71

where this latter passage follows from the algebraic manipulations in Appendix A.1. We thus recover Eq. (16) in the companion paper. We checked computationally the more general equivalence (implied by the theory) of Eqs. (69) with (80) in the companion paper, but a direct proof has remained elusive.

Conclusions

Both martingale and first-transition methods are having a revival in connection to thermodynamic considerations [24, 7, 16, 17, 29], and they may lead to independent generalizations and applications of our results. In both approaches, the main open question is the generalization to an arbitrary subset of currents – neither the full entropy production nor a single edge current.

As regards the first-transition approach followed here, as soon as one steps out of the single-edge case the Markov property of the process in transition space is lost. Here the combinatorial approach may allow some exploration.

Since any Radon-Nicodym derivative of two probability distributions over realizations of the process is a martingale, martingales can be used to generalise the results in this paper. From this approach it would seem that one can generate an arbitrary number of first-hitting results by building ad hoc auxiliary dynamics. However, the physical interpretation of this class of results may not be clear: it is crucial in our approach that the effective affinity has a clear operational interpretation. In particular, if one could tune its value by just “turning a knob”, then the effective affinity is just the difference of that knob’s value (in proper physical units) where one wants to perform the experiment and the value of that knob at which the observable current vanishes on average. This local operational interpretation dispenses one to compute the effective affinity from knowledge of all the inner details of the fundamental thermodynamic cycles that influence that particular current.

On a more speculative side, notice that in our derivation we made an arbitrary restriction of the solution of the Narayana generating function, based on the assumption that we expect probabilities to be real-valued. It may be interesting to explore the meaning of the complex-valued solution.

Notoriously, Boltzmann’s epitaph is his formula. But it took a whole community (including Einstein, Planck etc.) to digest it. So who’s formula is it?

Acknowledgements

MP is grateful to Paulo Fernando Lévano for a philosophical consultation.

Appendices

A.1: Trans-transition probabilities in terms of minors

We prove Eqs. (19), given (16) and (20). We use the well-known matrix inverse

[S-1]x,x=(-1)x+xdetS\(x|x)detS, 72

where we remind that S (as per Eq. (15)) is a matrix obtained from the generator R (Eq. (7)) by setting to zero the off-diagonal entries (1, 2) and (2, 1).

First consider detS\(1|1) and detS\(2|2). Since the removal of the first line and column, and of the second line and column, both take away the entries (1, 2) and (2, 1) which are the only ones that differ among S and R, these determinants are identical to detR\(1|1) and detR\(2|2). Therefore from Eq. (16) we find

p(|)=-R1,2[S-1]2,2=-R1,2detR\(2|2)detS=R1,2detR\(2|1)detS=νdetS 73

where in the second passage we used the well-known fact that for any stochastic rate matrix R the cofactors (-1)x+xdetR\(x|x) are independent of x (this follows for example from Eq. (85) in Appendix A.3), and in the third we used the definition in Eqs. (19) and (20). A similar formula is found for p(|).

As regards detS\(1|2) (respectively, detS\(2|1)), in this case the matrix resulting from the removal of the first row and second column only differs from R\(1|2) by entry R2,1. Using the Laplace cofactor expansion for determinants we thus obtain

detS\(1|2)=detR\(1|2)-r(1|2)detR\(1,2|2,1) 74

and given the definitions in Eq. (20) similar formulas as Eq. (73) follow for p(|) and p(|).

Finally, consider any matrix A, and decompose its determinant in terms that are respectively linear in A1,2, linear in A2,1 or that contain A1,2A2,1

detA=aA1,2+bA2,1+cA1,2A2,1+d 75

where abcd are four parameters that do not depend on A1,2 and A2,1 and are unrelated to previous notation. By Jacobi’s formula the dependency on A1,2 is

detAA1,2=-detA\(1|2). 76

Repeating with respect to A2,1 we obtain

c=2detAA1,2A2,1=detA\(1,2|1,2). 77

Coefficient a is found from Eq. (76) by subtracting away this latter term multiplied by A2,1, and similarly for b:

a=-detA\(1|2)-A2,1detA\(1,2|1,2) 78
b=-detA\(2|1)-A1,2detA\(1,2|1,2). 79

Taking A=R, we have detA=0 and d=detS. Therefore we obtain

detS=d-detA=-aA1,2-bA2,1-cA1,2A2,1=r(1|2)detR\(1|2)+r(2|1)detR\(2|1)+r(1|2)r(2|1)detR\(1,2|1,2) 80

where in the first passage we used Eq. (75) and in the second we used the explicit expressions for the parameters. In view of Eq. (20) this yields the trans-transition probabilities in Eq. (19).

A.2: From the Narayana generating function to trans-transition probabilities

First let us show Eq. (32). Using Eq. (31) we have

1-2x(1+y)+x2(1-y)2=1-2[2p(|)p(|)-p(|)-p(|)+1]+[p(|)+p(|)-1]2=1-4p(|)p(|)+2p(|)+2p(|)-2+p(|)2+p(|)2+1+2p(|)p(|)-2p(|)-2p(|)=[p(|)-p(|)]2. 81

Then from Eq. (27), for p(|)p(|)

G(x,y)=p(|)+p(|)-p(|)+p(|)2p(|)p(|)-1 82
=1p(|)-1 83
=p(|)p(|), 84

which is the lower entry of Eq. (33). Similarly for the other case.

A.3 Effective affinity and stalling distribution

First notice that the stationary distribution pX(x) of a continuous-time Markov generator R can be found as follows. Given that detR=0, expanding with Laplace’s cofactor formula, we find

0=detR=x(-1)x+xRxxdetR\(x|x). 85

But then

pX(x)=Z-1(-1)x+xdetR\(x|x) 86

for any choice of x, where Z is the normalization.

Now let R be the generator of the continuous-time Markov process where the visible rates are set to zero, r(1|2)=r(2|1)=0. We obtain, choosing (x,x)=(1,2) and (x,x)=(2,1)

p(1)=-Z-1detR\(2|1)p(2)=-Z-1detR\(2|1) 87

where the second follows from Eq. (74). But now notice that removing the first row and second column [or vice versa] from R results in the same matrix as by removing the first row and second column [or viceversa] from S. Therefore in view of Eq. (74), and because of the cancellation of the terms -Z-1, we find Eq. (38).

Finally, consider a system with local rates tuned to a stalling temperature T according to the principle of local detailed balance

r(1|2)r(2|1)=expδq12T 88

such that the mean current vanishes. Let RT be its generator. Notice that it differs from R. However, its stationary distribution is the same. In fact, computing Rx,xp(x) we find explicitly

x1,2,xr(x|x)p(x)-r(x|x)p(x)=xRx,xp(x)x2r(1|x)p(x)-r(x|1)p(1)+r(1|2)p(2)-r(2|1)p(1)=0=xRx,xp(x)x1r(2|x)p(x)-r(x|2)p(2)+r(2|1)p(1)-r(1|2)p(2)=0=xR2,xp(x) 89

where we used the fact that the mean current vanishes by assumption, and on the right-hand side we recognized the stationary equation Rp=0. .

Data Availability

Data sharing not applicable to this article as no datasets were generated or analysed during the current study.

Conflict of interest

The research was supported by the National Research Fund Luxembourg (project CORE ThermoComp C17/MS/11696700) and by the European Research Council, project NanoThermo (ERC-2015-CoG Agreement No. 681456).

Footnotes

1

More precisely, from Eq. (10) in Ref. [2], the stochastic entropy production of a generic system satisfies f-=1/e (e Neper’s number), which is the above formula for A=1; equivalently take s+ and set s-=1 in Eq. (5) in Ref.  [3].

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Bauer M, Cornu F. Affinity and fluctuations in a mesoscopic noria. J. Stat. Phys. 2014;155(4):703–736. [Google Scholar]
  • 2.Neri I, Roldán É, Jülicher F. Statistics of infima and stopping times of entropy production and applications to active molecular processes. Phys. Rev. X. 2017;7(1):011019. [Google Scholar]
  • 3.Neri I, Roldán É, Pigolotti S, Jülicher F. Integral fluctuation relations for entropy production at stopping times. J. Stat. Mech: Theory Exp. 2019;2019(10):104006. [Google Scholar]
  • 4.Neri I. Universal tradeoff relation between speed, uncertainty, and dissipation in nonequilibrium stationary states. SciPost Physics. 2022;12(4):139. [Google Scholar]
  • 5.Ptaszyński K. First-passage times in renewal and nonrenewal systems. Phys. Rev. E. 2018;97(1):012127. doi: 10.1103/PhysRevE.97.012127. [DOI] [PubMed] [Google Scholar]
  • 6.Polettini M. Nonequilibrium thermodynamics as a gauge theory. Europhys. Lett. (EPL) 2012;97(3):30003. [Google Scholar]
  • 7.Harunari PE, Garilli A, Polettini M. Beat of a current. Phys. Rev. E. 2023;107(4):042105. doi: 10.1103/PhysRevE.107.L042105. [DOI] [PubMed] [Google Scholar]
  • 8.Ibe O. Markov Processes for Stochastic Modeling. Newnes: Elsevier; 2013. [Google Scholar]
  • 9.Suzumura K. Perron–Frobenius theorem on non-negative square matrices: an elementary proof. Hitotsubashi Journal of Economics. 1983;24:137–141. [Google Scholar]
  • 10.Petersen, T.K.: Eulerian numbers. In: Eulerian Numbers, pp. 3–18. Springer, New York (2015)
  • 11.Stanley, R.P.: Enumerative Combinatorics, vol. 1, 2nd edn. Cambridge Studies in Advanced Mathematics. Cambridge University Press, Cambridge (2011)
  • 12.Polettini M, Esposito M. Effective fluctuation and response theory. J. Stat. Phys. 2019;176(1):94–168. [Google Scholar]
  • 13.Bisker G, Polettini M, Gingrich TR, Horowitz JM. Hierarchical bounds on entropy production inferred from partial information. J. Stat. Mech: Theory Exp. 2017;2017(9):093210. [Google Scholar]
  • 14.Maes, C.: Local Detailed Balance. SciPost Physics Lecture Notes, vol. 032. SciPost, Amsterdam (2021)
  • 15.Esposito M, Van den Broeck C. Three faces of the second law. I. Master equation formulation. Phys. Rev. E. 2010;82(1):011143. doi: 10.1103/PhysRevE.82.011143. [DOI] [PubMed] [Google Scholar]
  • 16.Harunari PE, Dutta A, Polettini M, Roldán É. What to learn from a few visible transitions’ statistics? Phys. Rev. X. 2022;12(4):041026. [Google Scholar]
  • 17.Van der Meer J, Ertel B, Seifert U. Thermodynamic inference in partially accessible Markov networks: a unifying perspective from transition-based waiting time distributions. Phys. Rev. X. 2022;12(3):031025. [Google Scholar]
  • 18.Polettini M, Falasco G, Esposito M. Tight uncertainty relations for cycle currents. Phys. Rev. E. 2022;106(6):064121. doi: 10.1103/PhysRevE.106.064121. [DOI] [PubMed] [Google Scholar]
  • 19.Jiang Y, Wu B, Jia C. Large deviations and fluctuation theorems for cycle currents defined in the loop-erased and spanning tree manners: a comparative study. Physical Review Research. 2023;5(1):013207. [Google Scholar]
  • 20.Neri I. Estimating entropy production rates with first-passage processes. J. Phys. A: Math. Theor. 2022;55(30):304005. [Google Scholar]
  • 21.Altaner B, Polettini M, Esposito M. Fluctuation-dissipation relations far from equilibrium. Phys. Rev. Lett. 2016;117(18):180601. doi: 10.1103/PhysRevLett.117.180601. [DOI] [PubMed] [Google Scholar]
  • 22.Jona-Lasinio G. Large deviations and the Boltzmann entropy formula. Brazilian Journal of Probability and Statistics. 2015;29(2):494–501. [Google Scholar]
  • 23.Penocchio, E.: Thermodynamics of chemical engines: A chemical reaction network approach. PhD thesis, Physics (2022)
  • 24.Polettini, M.: Of dice and men. Subjective priors, gauge invariance, and nonequilibrium thermodynamics. arXiv preprint (2013). arXiv:1307.2057
  • 25.Maes C, Netočnỳ K. Minimum entropy production principle from a dynamical fluctuation law. J. Math. Phys. 2007;48(5):053306. [Google Scholar]
  • 26.Polettini M. Fact-checking ziegler’s maximum entropy production principle beyond the linear regime and towards steady states. Entropy. 2013;15(7):2570–2584. [Google Scholar]
  • 27.Neri, I., Polettini, M.: Extreme value statistics of edge currents in Markov jump processes. SciPost Physics 14, 131 (2023). 10.21468/SciPostPhys.14.5.131
  • 28.Cavina V, Mari A, Giovannetti V. Optimal processes for probabilistic work extraction beyond the second law. Sci. Rep. 2016;6(1):1–13. doi: 10.1038/srep29282. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Sekimoto, K.: Derivation of the first passage time distribution for Markovian process on discrete network. arXiv Preprint (2021). arXiv:2110.02216

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Data sharing not applicable to this article as no datasets were generated or analysed during the current study.


Articles from Journal of Statistical Physics are provided here courtesy of Springer

RESOURCES