Active Inference and Functional Parametrisation: Differential Flatness and Smooth Random Realisation

Hugues Mounier; Thomas Parr; Karl Friston

doi:10.3390/e28010087

. 2026 Jan 11;28(1):87. doi: 10.3390/e28010087

Active Inference and Functional Parametrisation: Differential Flatness and Smooth Random Realisation

Hugues Mounier ^1,^*, Thomas Parr ², Karl Friston ^3,^4,^*

Editor: José FF Mendes

PMCID: PMC12839773 PMID: 41593994

Abstract

This paper is a first attempt to marry constructive nonlinear control theory techniques with active inference. Specifically, we are interested in the relationship between differential flatness and the design of generative models for use in control settings. We place specific emphasis on the pathwise properties of differentially flat systems that inherit from their definition in terms of successive temporal derivatives and relate this to the use of generalised coordinates of motion in formulating continuous-time generative models in active inference. To illustrate the basic concepts, we appeal to the example of oculomotor control.

Keywords: differential flatness, active inference, periodic smooth random functions, pathwise formulations

1. Introduction

Active inference is one, if not the most, promising formal framework for computational neuroscience, with many applications in a number of areas (see, e.g., [1] and the references therein). The recent developments foregrounding pathwise formulations and Bayesian mechanics—developed in [2,3] among others—furnish a principled and natural setting to address many aspects of perception, planning, and control. The central idea that we will appeal to in this paper is that (natural or artificial) creatures make use of implicit generative world models both to draw inferences about the world and to control their sensed world. It is the latter that is more salient for our current purposes. In fact, one could regard the inferential (i.e., perceptual) role of generative models as merely a method of attaining tight bounds on marginal likelihoods—the quantity of interest when optimising sensory data through action.

We here consider some conditions whose fulfilment may help to construct generative models well suited for controlling one’s environment. Crucially, conditions are in direct agreement with constructive frameworks developed in nonlinear control (see, e.g., [4,5,6]). We thus try to show how this framework, and in particular the differential flatness structural property, can be leveraged in applications of active inference to control problems.

Many insightful works have already been published in this general area, both on the links and differences between active inference and classical control schemes and on deploying active inference for control (see, e.g., [7,8,9,10,11,12,13,14] just to name a few).

At first sight, differential flatness and active inference seem quite distant frameworks. The first aims to reduce a trajectory tracking error to zero, while the second minimises surprise or variational free energy; the first is inherently deterministic, and the second naturally deals with stochastic fluctuations. We shall see that the trajectory tracking error is indeed a form of surprise and that the very definition of differential flatness can be adapted to smooth random fluctuations, particularly apt for neuroscientific applications. There are two main approaches to minimise surprise or discrepancy:

○
The first is to envision the problem as an optimisation procedure, as in active inference (but also in optimal control or model predictive control); in this respect, the goal is to fulfil an optimisation criterion, leading to the minimisation of target discrepancies.
○
The second is to derive a deterministic control (action) scheme for the goal and estimate the fluctuations online to actively compensate for them.

The first route is that taken by active inference, while the second underwrites the differential flatness approach. We shall nevertheless see that the two frameworks are not as different as one may think. Indeed, the mode dynamics enforced by free energy minimisation—when those dynamics enjoy the differential flatness property—yield a functional parametrisation linking the two schemes intimately. Moreover, this parametrisation appears to be a most fruitful and promising tool to study nonlinear dynamics in neuroscience and biology.

More broadly, here, we try to foreground some potentially useful features of our framework, which draws from dynamical control system structural properties in a typical pathwise and physically preserving formulation. In particular, we shall see how the most striking feature of differential flatness—namely, differential parametrisation—can be of use within the active inference framework. More specifically, this differential parametrisation induces, among other relations, an invertible mapping from the sensation to action.

The problem we will be interested in in Section 5 is one in which we have a generative model that describes how (we believe) some system will evolve over time as a function of some control variables. Given a desired state or path for this system, our interest is in understanding how the control variables would have to be set in order to realise that desired configuration. More specifically, we are interested in the properties a generative model should possess in order to infer the control variables that realise some goal.

This paper is structured as follows. We first provide a brief introduction to the idea of a generative model for control and introduce some of the key definitions we will appeal to subsequently. Several of these definitions turn out to rely upon differentiable fluctuations, which leads us to a consideration of the interpretation of stochastic fluctuations that we will commit to (i.e., the use of random periodic functions). Having outlined the basic structure of the sorts of generative models we are interested in, we consider the way in which the principles that underwrite active inference constrain choices we might make when designing these models and equipping them with goals. We use the forms of the key objective functions from active inference (variational and expected free energy) to consider some of the generic properties of good generative models and consider whether these properties can be motivated by appealing to the notion of differential flatness. Following these theoretical considerations, we demonstrate the way these ideas work in practice through a worked example based upon oculomotor control. We conclude with a discussion of the relationship between some of these ideas and active inference, with a particular focus on the notion of generalised coordinates of motion, which inherit from similar ideas to that of differential flatness but turn out to play a very different role.

2. Preliminary Notions: Generative Model, Action, State, and Fluctuation Choice

2.1. Generative Models

As outlined above, we are interested in control problems that are formulated in terms of generative models. As an example to make this clear, which will be unpacked in much greater detail later on, consider how we might go about controlling the positions of our eyes such that we achieve a specific fixation point on a surface or track a moving point. To decide how to move our eyes, our brains might employ a model in which action variables ( $u$ ), like the contraction of extra-ocular muscles, might influence the dynamics of states ( $x$ ), such as the angle of our gaze or fixation point, and in which these states return some observable (e.g., visual) data ( $y$ ). We now precisely outline the notion of a generative model.

Definition 1 (Generative model).

A generative model is a set of stochastic differential equations of the following form:

$\begin{matrix} \dot{x} & = f (x, u, θ) + ζ_{x} \end{matrix}$ (1a)

$\begin{matrix} y & = h (x, θ) + ζ_{y} \end{matrix}$ (1b)

where $x (t) \in R^{n}$ is a vector representing the state of the system (e.g., oculomotor apparatus) to be controlled (see Definition for a precise definition), $u (t) \in R^{m}$ is the action vector (an external control input), the signal or instruction we might send to that system, θ is a vector of the agent’s model parameters, $y (t) \in R^{p}$ is the output vector, the agent’s sampling of the sensor signal, and $ζ_{x} (t) \in R^{n}$ and $ζ_{y} (t) \in R^{p}$ are the state and output fluctuations, considered, in the Stratonovich sense, as smooth random functions (see Section 2.3 below). The functions f and h are here supposed to be meromorphic in their arguments, and we suppose that $x, y$ belong to a functional space $F$ such that $F ⫋ C^{0} (R)$ , with $C^{0} (R)$ being the space of continuous functions in time.

It is often convenient, in applications of active inference, to write a generative model in terms of generalised coordinates of motion. These coordinates are the coefficients of a Taylor series expansion of variables around the current time. In doing so, we replace a single nonlinear stochastic differential equation (for x) $\dot{x} = f (x, u, θ) + ζ_{x}$ with a set of locally linearised equations for each order of motion: ${\dot{x}}^{(n)} = f^{(n)} (x^{(n)}, u, θ) + ζ_{x}^{(n)}$ , where the superscript in brackets indicates the order of motion (i.e., the term in the Taylor series to which the coefficient belongs or the order of the temporal derivative one must take to get to these variables from our initial variable).

The motivation for this formulation is twofold. First, it allows one to express noise processes of varying smoothness in terms of the covariance between the fluctuations assigned to each order of motion. Second, through applications of the chain rule, it allows one to determine the gradients of generalised coordinates of y with respect to u via the gradients of the generalised coordinates of x. This will be important later, when we look to the way in which descent of variational free energy gradients—by changing u—can be understood as analogous to the enaction of spinal reflex arcs that bring proprioceptive data in line with anticipated setpoints.

The need to find mappings from sensation to action is one of the key ideas common to active inference—using generalised coordinates of motion—and differential flatness as applied to control. It is interesting that both approach this problem by appealing to successive derivatives of the variables in a generative model.

Definition 2 (Mean generative model).

Given a generative model as above, the mean, or deterministic, generative model is given by the following set of differential equations:

$\begin{matrix} \dot{x} & = f (x, u, θ) \end{matrix}$ (2a)

$\begin{matrix} y & = h (x, θ) \end{matrix}$ (2b)

It thus corresponds to the deterministic part of the corresponding stochastic differential equation system.

2.2. Action, Output, and State

We here give precise definitions for the notions of action, or control input, output, state, and realisation, which will be useful in Section 6.2. These were given in a differential algebraic setting in [4]; see also [15] for an elementary treatment, analogous to the one below (although stated in a deterministic setting). Let us note that the subsequent definitions, especially the differential flatness definitions, are made in the presence of fluctuations. This is not to say that we lose the deterministic character of this notion, since these perturbations may well be deterministic functions. They may also be stochastic, if they are sufficiently smooth, i.e., differentiable up a certain order.

Note that the action $u (t)$ , or the control input, are functions enabling us to act on the system in order to fulfil a specified goal. The dynamics equations thus form an undetermined system of differential equations, since the control functions $u (t)$ are not a priori determined. Once the control variables are fixed (i.e., substituted with known functions of time), the system (1) becomes determined (i.e., can be solved or integrated).

An output $y (t)$ is generated by the model. These outputs or observations may represent signals coming from the senses, in case the agent is a living being, or from the sensors in the case of an artificial system. More precisely, we have the following definition.

Definition 3 (action (control input) and output).

Consider a model with variables $z (t) = (z_{1} (t), \dots, z_{r} (t))$ , fluctuations $ζ (t) = (ζ_{1} (t), \dots, ζ_{s} (t))$ , and equations

$\begin{matrix} Φ (z, \dot{z}, \dots, z^{(ρ_{z})}, ζ, \dot{ζ}, \dots, ζ^{(ρ_{ζ})}) & = 0 \end{matrix}$ (3)

A control input, or action, of the model is an m-tuple (with $m < r$ ) of variables $u (t) = (u_{1} (t), \dots,$ $u_{m} (t))$ with the following properties.

Endogenous character.

The components of $u$ can be (locally and generically) expressed as a function of $z$ and its derivatives:
$\begin{matrix} u_{i} & = ψ_{u i} (z, \dot{z}, \dots, z^{(r_{u i})}) \end{matrix}$ (4)

Differential independence.

There does not exist any nontrivial differential relation of the form
$\begin{matrix} R_{u} (u, \dot{u}, \dots, u^{(β_{u})}) & = 0 \end{matrix}$ (5)

Any system variable is influenced by the input.

Every $z_{i}$ , $i = 1, \dots, r$ satisfies
$\begin{matrix} E_{i} (z_{i}, {\dot{z}}_{i}, \dots, z_{i}^{(δ_{z i})}, u, \dot{u}, \dots, u^{(δ_{u i})}, ζ, \dot{ζ}, \dots, ζ^{(δ_{ζ i})}) & = 0 \end{matrix}$ (6)

An output $y (t) = (y_{1} (t), \dots, y_{p} (t))$ (with $p < r$ ) is a p-tuple, where the $y_{i}$ s are functions of the model’s variables:

$\begin{matrix} y_{i} & = h_{y_{i}} (z, \dot{z}, \dots, z^{(ν_{z})}, ζ, \dot{ζ}, \dots, ζ^{(ν_{ζ})}) \end{matrix}$ (7)

Remark 1 (Local genericity).

Generally, in (4), we have to deal with implicit functions:

$\begin{matrix} Ψ (u, z, \dot{z}, \dots, z^{(r_{u})}) & = 0 \end{matrix}$ (8)

And, locally, in the neighbourhood of generic regular points (i.e., points where the Jacobian of Ψ with respect to $u$ is regular), we can solve (8) for $u$ using the implicit function theorem. This will be termed locally and generically here and in the forthcoming definitions.

The state variables represent the instantaneous memory of the system: once the control (action) variables have been determined, the knowledge of the state variables (at time t) enables the prediction of the future state (at time $t + d t$ ). A complementary formulation is the following: the state of a dynamical system is a set of physical quantities, the specification of which (in the absence of external excitation) completely determines the evolution of the system. More precisely, we have the following definition.

Definition 4 (state).

Consider a model with variables $z (t) = (z_{1} (t), \dots, z_{r} (t))$ , input $u (t) = (u_{1} (t), \dots,$ $u_{m} (t))$ , fluctuations $ζ (t) = (ζ_{1} (t), \dots, ζ_{s} (t))$ , and equations

$\begin{matrix} Φ (z, \dot{z}, \dots, z^{(ρ_{z})}, u, \dot{u}, \dots, u^{(ρ_{u})}, ζ, \dot{ζ}, \dots, ζ^{(ρ_{ζ})}) & = 0 \end{matrix}$ (9)

A state of the model is an n-tuple (with $n ⩽ r$ ) of variables $x (t) = (x_{1} (t), \dots, x_{n} (t))$ with the following properties.

Endogenous character.

The components of $x$ can be (locally and generically) expressed as a function of $z$ and its derivatives:
$\begin{matrix} x_{i} & = ψ_{x i} (z, \dot{z}, \dots, z^{(r_{x i})}) \end{matrix}$ (10)

Independence with respect to the input.

There does not exist any nontrivial differential relation of the form
$\begin{matrix} R_{x} (x, u, \dot{u}, \dots, u^{(β_{x})}) & = 0 \end{matrix}$ (11)

Representation property.

Every variable $\overset{ˇ}{x}$ that is a function of $z$ and of its time derivatives can be expressed through $x$ , $u$ and its derivatives. In other words, there exists (locally and generically) a representation of the form
$\begin{matrix} \overset{ˇ}{x} & = ϕ_{\overset{ˇ}{x}} (x, u, \dot{u}, \dots, u^{(β_{\overset{ˇ}{x}})}, ζ, \dot{ζ}, \dots, ζ^{(β_{ζ i})}) \end{matrix}$ (12)

By virtue of the above representation property, for all $i = 1, \dots, n$ , the time derivatives ${\dot{x}}_{i}$ are expressed through $x$ , $u$ and its time derivatives. This can be otherwise stated as follows.

Definition 5 (State representation).

Consider a model with variables $z (t) = (z_{1} (t), \dots, z_{r} (t))$ , input $u (t) = (u_{1} (t), \dots,$ $u_{m} (t))$ , fluctuations $ζ (t)$ $= (ζ_{1} (t),$ $\dots,$ $ζ_{s} (t))$ , equations

$\begin{matrix} Φ (z, \dot{z}, \dots, z^{(ρ_{z})}, u, \dot{u}, \dots, u^{(ρ_{u})}, ζ, \dot{ζ}, \dots, ζ^{(ρ_{ζ})}) & = 0 \end{matrix}$ (13)

and state $x$ . Then, there exists a so-called state representation of the form

$\begin{matrix} {\dot{x}}_{i} & = ϕ_{x_{i}} (x, u, \dot{u}, \dots, u^{(β_{x_{i}})}, ζ, \dot{ζ}, \dots, ζ^{(β_{ζ i})}) \end{matrix}$ (14)

When the right-hand side of (14) does not depend on the derivatives of the input $u$ (i.e., when $β_{x_{i}} = 0$ for all $i = 1, \dots, n$ ), the state representation is called classical.

A realisation of a model consists of a state and a state representation for this model, as the following definition states.

Definition 6 (Realisation).

Consider a model with variables $z (t) = (z_{1} (t), \dots, z_{r} (t))$ , fluctuations $ζ (t) = (ζ_{1} (t), \dots, ζ_{s} (t))$ , and equations

$\begin{matrix} Φ (z, \dot{z}, \dots, z^{(ρ_{z})}, ζ, \dot{ζ}, \dots, ζ^{(ρ_{ζ})}) & = 0 \end{matrix}$ (15)

where $z$ comprises an input $u$ and output $y$ : ${u_{1}, \dots, u_{m}, y_{1}, \dots, y_{p}} \subset {z_{1}, \dots, z_{r}}$ . A realisation of this model consists in the existence of a state $x$ and a state representation of the form (14) for the model.

2.3. Fluctuation Choice

An interesting point about the definitions above is that they depend upon there being differentiable (i.e., smooth or analytic) fluctuations. This means we need to think carefully about what we mean by fluctuations—an important topic generally in the study of stochastic dynamical systems. Several choices can be made for the fluctuations $ζ_{*}$ , $* \in {x, y}$ appearing in the above generative models. These include the following:

○
Stochastic processes in the Itô sense.
○
Stochastic processes in the Stratonovich sense.
○
Nonstandard infinitesimals (see, e.g., [16,17,18]);
○
Stochastic processes with Hölder continuous sample paths, yielding radom ODEs (RODEs) (see, e.g., [19]);
○
Rough paths (see, e.g., [20]);
○
Random Fourier series (RFS) (see, e.g., [21,22] for metric and convergence properties, with extensions on Riemannian manifolds [23] and locally compact groups [24]; see also [25] for an engineer’s view).

Let us choose the latter because they may furnish a convenient $C^{\infty}$ form of fluctuations, and the solution of random ODEs can be shown to converge to solutions of a Stratonovich stochastic differential equation (see, e.g., [25], Theorem 5.1). We may thus consider so-called periodic smooth random functions:

\begin{matrix} ζ (t) & = a_{0} + \sum_{j = 1}^{r} [a_{j} cos (\frac{2 π j}{L} t) + b_{j} sin (\frac{2 π j}{L} t)], r = ⌊ L / λ ⌋ \end{matrix}

(16)

where $λ, L > 0$ , each $a_{j}$ and $b_{j}$ is an independent sample from $N (0, 1 / (2 r + 1))$ (where $N (μ, V)$ denotes the real normal distribution of mean $μ$ and variance V), and $⌊ \cdot ⌋$ is the floor function. This type of function is L-periodic, entire, and $(2 π / λ)$ -band-limited.

Then, multiplying (16) by $\sqrt{2 / λ}$ and taking the variance to be $1 / ((2 r + 1) λ)$ instead of $1 / (2 r + 1)$ , we obtain the notion of a big periodic smooth random function. Note that since $r \approx L / λ$ , we have $2 / ((2 r + 1) λ) \approx 1 / L$ . Thus, in the big normalisation, the random coefficients of the sum $ζ$ have variances essentially independent of $λ$ as $λ \to 0$ . Then, as $λ \to 0$ , indefinite integrals of big smooth random functions converge with probability 1 to standard Brownian paths (see, e.g., Theorem 4.3 of [25] or Theorem 2, p. 236 of [21]). Regarding smooth random functions, the reader may also consult the very interesting works of R.J. Adler and J.E. Taylor, [26,27,28], with insightful chapters on geometry and smooth random manifolds, among others. Interestingly, this technology underwrites generative models in classical brain imaging analysis, namely, statistical parametric mapping, based upon random field theory and topological inference [29,30]. The previous convergence can also be related to the Wong–Zakai theorems (see, e.g., [31]) used in the insightful paper [32], Subsection 1.3.2.

Smooth random functions may not be an apt choice at atomic scales, where a particle’s movements are highly erratic. However, they become particularly appropriate at the cell and mesoscopic scales and most probably appropriate at macroscopic scales at which many fluctuations are generated by dynamical systems that evolve over a timescale faster than that considered for a given control problem (see [1] and the seminal observations of Stratonovich: “a certain care must be taken in replacing an actual process by Markov process, since Markov processes have many special features, and, in particular, differ from the processes encountered in radio engineering by their lack of smoothness… Any random process actually encountered in radio engineering is analytic, and all its derivatives are finite with probability one” ([33], pp. 122–124)).

Remark 2 (Wavelet random series).

One may be tempted, in the spirit of the above, to consider wavelet random series, since wavelet expansions are better behaved than their Fourier counterparts. However, recent works by C. Esser, S. Jaffard, and B. Vede [34] (see also [35]) suggest that caution is in order; in contrast to Fourier series, the randomisation of almost every continuous function gives an almost surely nowhere locally bounded function.

We sill refer to the notion of a tube around a nominal function, i.e., for $f (t) + ζ (t)$ , where f is a deterministic function and $ζ$ a periodic smooth random function. We shall be concerned with estimating the probability of leaving a tube of width $2 λ$ (for $λ > 0$ ). For instance, one has the following estimate (see [36]):

\begin{matrix} P (sup_{t \in [0, T]} ζ (t) ⩾ λ) ⩽ (\frac{σ}{\sqrt{2 π} λ} + \frac{T \sqrt{λ_{2}}}{2 π σ}) exp (\frac{- λ^{2}}{2 σ^{2}}) \end{matrix}

with $σ = \sqrt{E (| ζ (t) |^{2})}$ , $λ_{2} = E (| \dot{ζ} (t) |^{2})$ . The reader may consult [36] for generalisations of this bound inequality. The notable paper by [37] also contains results for the volume of tubes using Gaussian–Minkowski functionals.

Example 1 (Smooth random functions).

The following two plots are examples of smooth random functions with varying wavelength λ. More precisely, the left plot is a smooth random function (a sum of sines and cosines of the form (16)) with parameters $L = 8$ , $λ = 2$ ; recall that the sum has $r + 1$ terms, with $r = ⌊ L / λ ⌋$ . Thus, when $L = 8$ , $λ = 2$ , we obtain $r = 4$ terms in the sum. The right plot is an analogous sum with $r = ⌊ 8 / 0.1 ⌋ = 80$ terms (See Figure 1).

Examples of smooth random functions with parameters $L = 8$ , $λ = 2$ (**left plot**) and $L = 8$ , $λ = 0.1$ (**right plot**).

In order to offer a more concrete intuition as to the notions of state, control input, and output, let us consider a simple, although generic, example.

Example 2 (Simple generic example; model).

Consider the following simple example with scalar action (control) and sensor output:

$\begin{matrix} {\dot{x}}_{1} & = x_{2} + ζ_{x_{1}} \end{matrix}$ (17a)

$\begin{matrix} {\dot{x}}_{2} & = f (x_{1}, x_{2}) + u + ζ_{x_{2}} \end{matrix}$ (17b)

$\begin{matrix} y & = x_{1} + ζ_{y} \end{matrix}$ (17c)

where f is a smooth function: $f \in C^{\infty} (R^{2})$ . The fluctuations are taken as smooth random Gaussian functions:

$\begin{matrix} ζ_{x_{1}} (t) & = \sum_{k ⩾ 0} a_{k_{1}} ϕ_{k} (t), & a_{k_{1}} & = N (0, σ_{x}) \end{matrix}$ (18a)

$\begin{matrix} ζ_{x_{2}} (t) & = \sum_{k ⩾ 0} a_{k_{2}} ϕ_{k} (t), & a_{k_{2}} & = N (0, σ_{x}) \end{matrix}$ (18b)

$\begin{matrix} ζ_{y} (t) & = \sum_{k ⩾ 0} c_{k} ϕ_{k} (t), & c_{k} & = N (0, σ_{y}) \end{matrix}$ (18c)

$\begin{matrix} ϕ_{k} (t) & = cos (\frac{2 π j}{L} t), & r & = ⌊ L / ν ⌋ \end{matrix}$ (18d)

with $ν, L > 0$ . Here (2) are the equations of a generative model, in the sense of Definition 1, $x = (x_{1}, x_{2})$ is the state of the generative model, in the sense of Definition 4, u is the input, and y is an output, in the sense of Definition 3.

This type of model includes all models predicated upon Newton’s law of motion, such as

$\begin{matrix} M \overset{. .}{y} & = F (y, \dot{y}) + v \end{matrix}$ (19)

where M is a mass, F is a model for internal forces depending on the position y and velocity $\dot{y}$ , and v is an external force.

A more general case would be the following:

$\begin{matrix} {\dot{x}}_{1} & = f_{1} (x_{1}, x_{2}) + ζ_{x_{1}} \end{matrix}$ (20a)

$\begin{matrix} {\dot{x}}_{2} & = f_{2} (x_{1}, x_{2}, x_{3}) + ζ_{x_{2}} \end{matrix}$ (20b)

$⋮$

$\begin{matrix} {\dot{x}}_{n} & = f (x_{1}, \dots, x_{n}) + u + ζ_{x_{n}} \end{matrix}$ (20c)

$\begin{matrix} y & = x_{1} + ζ_{y} \end{matrix}$ (20d)

with $f_{i}$ s being smooth functions of their arguments, invertible with respect to the final function. All the computations made for the simple example (17) can be readily extended to the above.

3. Free Energy, Flatness, and Conceptual Similarities

3.1. Free and Expected Free Energy

One can understand both active inference- and differential flatness-informed control schemes as identifying mappings from sensation to action from models that detail the influence of action on sensation. Flatness rests upon there being an invertible mapping between action and sensation such that a desired sensory trajectory uniquely determines the actions that generate it. Active inference involves the selection of actions that bring sensations in line with the mode of a marginal density of sensory data implied by a generative model. This is mediated by reflexive actions determined by sensory data. The marginal density that identifies desired sensory trajectories is often specified in terms of an expected free energy—whose role is to determine prior plausibilities of alternative action sequences based upon their capacity to minimise the Kullback–Lieber divergence (also known as risk) between desired and anticipated sensory trajectories.

Consistent with control-theoretic formulations of the sort outlined above, active inference can be formulated as optimising a functional of a model that relates controllable variables to some observable outcomes. Specifically, it depends upon optimisation (minimisation) of a variational free energy that acts as an upper bound on the surprise or negative log marginal likelihood of those observations. Variational free energy can be formulated in several ways to quantify the performance of a system engaging in active inference in terms of energies (i.e., surprise) and divergences (i.e., relative entropies):

\begin{array}{l} F (u (t), y (t)) = \\ \underset{Energy}{\underset{︸}{- \int q (x (t) | u (t)) ln p (x (t), y (t) | u (t)) d x (t)}} + \underset{Neg - entropy}{\underset{︸}{\int q (x (t) | u (t)) ln q (x (t) | u (t)) d x (t)}} \end{array}

(21a)

\begin{matrix} = \underset{Divergence}{\underset{︸}{\int q (x (t) | u (t)) ln \frac{q (x (t) | u (t))}{p (x (t) | y (t), u (t))} d x (t)}} + \underset{Surprise}{\underset{︸}{(- ln p (y (t) | u (t)))}} \end{matrix}

(21b)

\begin{matrix} = \underset{Complexity}{\underset{︸}{\int q (x (t) | u (t)) ln \frac{q (x (t) | u (t))}{p (x (t) | u (t))} d x (t)}} - \underset{Accuracy}{\underset{︸}{\int q (x (t) | u (t)) ln p (y (t) | x (t), u (t)) d x (t)}} \end{matrix}

(21c)

The above presents variational free energy $F$ as a functional of two probability distributions (for discrete states) or densities (for continuous states). The density labelled p is that associated with our generative model, while q represents a density variously referred to as a recognition density, an approximate posterior density, or a variational density. For the purposes of this paper, the variational density is assumed to have already been optimised such that $q (x (t) | u (t)) = p (x (t) | y (t), u (t))$ . Each formulation of the free energy depends upon different factorisations of the generative model. When expressed as a joint density, the minimisation of free energy can be seen as a constrained maximum entropy problem. On factorising into conditionals and marginal likelihoods, the free energy is seen to be an upper bound on surprise —the negative log marginal likelihood, Bayesian model evidence, or improbability of an observation under a given generative model. Finally, factorising a generative model into priors and likelihoods gives us a balance between complexity—how far we must move from prior beliefs to explain observations — and the accuracy with which we can account for sensory inputs.

Further to this, one can formulate expected free energies that highlight other forms of discrepancy that are especially relevant for the optimisation of control. When formulated explicitly in a pathwise setting, we have

\begin{matrix} G (u (t)) & = - \underset{Mutual information}{\underset{︸}{\int \int q (x (t), y (t) | u (t)) ln \frac{q (x (t), y (t) | u (t))}{p (x (t), y (t) | u (t))} d x (t) d y (t)}} - \end{matrix}

\begin{matrix} \underset{Expected value}{\underset{︸}{\int q (y (t) | u (t)) ln p (y (t) | y_{r} (t)) d y (t)}} \\ = \underset{Risk}{\underset{︸}{\int q (y (t) | u (t)) ln \frac{q (y (t) | u (t))}{p (y (t) | y_{r} (t))} d y (t)}} + \end{matrix}

(22a)

\begin{matrix} \underset{Ambiguity}{\underset{︸}{(- \int \int q (y (t), x (t) | u (t)) ln p (y (t) | x (t), u (t)) d x (t) d y (t))}} \end{matrix}

(22b)

The expected free energy is typically used for planning, where we might integrate this quantity along future paths and assign paths of control states higher probabilities for lower expected free energies. Of particular relevance for the discussion that follows is the idea that, by placing priors over the path of observations we wish to obtain (here indicated by conditioning upon a goal $y_{r}$ ), optimisation of expected free energy involves determining the set of control paths that would realise these outcomes.

We can now identify at least three different aspects of optimality, namely surprise, inadequacy, and discrepancy:

The discrepancy in observation (inaccuracy) in gathering information from the world. This discrepancy is implicit in the (negative) accuracy term of the variational free energy, which under Gaussian assumptions will reduce to the square of a prediction error: $ε_{o b s} = y - \hat{y}$ , where $\hat{y}$ is the predicted observation under beliefs about $x$ . This is the (sensory) prediction error that features in predictive coding or Bayesian filtering, linear quadratic control, and model predictive control.
The discrepancy in action in the way the agent acts in the world. This discrepancy is between the goal $y_{r} (t)$ , a predefined trajectory to be followed, and the actual position $y (t)$ : $ε_{a c t} = y - y_{r}$ . This is implicit in the risk term of the expected free energy, which quantifies the divergence between the distribution anticipated under a set of control states and the distribution anticipated given the goal.
The discrepancy in modelling in the way the agent represents the world internally. This discrepancy would normally be quantified using the marginal likelihood of sensory observations under the model and is reflected in the surprise term of the variational free energy. When free energy is minimal with respect to q, it becomes a tight bound on this discrepancy. For this reason, variational free energy is often used as a tractable method of approximating Bayes factors to compare alternative models in statistical inference.

The free energies above ( $F$ and $G$ ) can be seen as so-called global Lyapunov functions (see, e.g., [38]) that capture these aspects seen in optimal control (see, e.g., [39,40]). In what follows, we shall mostly be interested in $G$ , and specifically the risk term, leaving the other aspects to future works. More precisely, we are interested in whether the notion of differential flatness coheres with the selection of generative models that optimise $F$ and $G$ . The derivation of trajectory tracking action laws on differentially flat models will be seen to minimise the above risk in $G$ (see Section 4.6).

Table 1 summarises the notations used so far, where bold symbols are vectors.

Table 1.

Notation table.

Variable	Meaning
$x$	Hidden state vector
$y$	Output vector (e.g., sensor data)
$u$	Action vector
$ζ_{x}$	Fluctuations in hidden state dynamics
$ζ_{y}$	Fluctuations in sensor data
$F$	Variational free energy
$G$	Expected free energy

Open in a new tab

3.2. Differential Flatness

3.2.1. Controllability

A ubiquitous notion—when one wishes to steer a system—is the global controllability, as stated in the following definition (see, e.g., [41]).

Definition 7.

A system such as (1), but idealised such that we assume the fluctuations are infinitely precise,

$\begin{matrix} \dot{x} & = f_{m} (x, u) \end{matrix}$ (23a)

$\begin{matrix} y & = h_{m} (x) \end{matrix}$ (23b)

is said to be globally controllable if, for any time instants $t_{0}$ and $t_{1}$ and initial and final states $x_{0}$ and $x_{1}$ , there exists an action (a control law) $u$ in a space of admissible controls steering the system from $x (t_{0}) = x_{0}$ to $x (t_{1}) = x_{1}$ .

The reader may note that this definition is purely descriptive: it does not contain any constructive procedure for steering the system; it is not possible to infer, from reading the definition alone, the form of the control law to be applied to go from $x_{0}$ to $x_{1}$ . This is a definition of the existence-of-a-solution kind, not of a solution construction to the given problem.

The reader may note that this definition is purely descriptive, in that it does not contain any constructive procedure for steering the system. Moreover it is pointwise in spirit, rather than pathwise. Indeed, this definition does not say anything about the path that links the initial to the final state. This path can be implausible and still fulfil the controllability requirements. We shall see in the next subsection a stronger and, in our sense, more useful property, namely, differential flatness.

3.2.2. Motivation Through Observation and Action

The premise of active inference is that an agent seeks to minimise the surprise or divergence between its beliefs or expectations about the surrounding environment and the actual state it experiences. This minimisation can, in principle, be enforced through effective information gathering and action or be engrained in the very structure of the agent’s model (such as being refined through learning and evolution).

We will examine a case where the requisite efficiency is encoded in the agent’s model structure itself. More precisely the imperatives for perception and action are directly fulfilled through the following:

(Odsf)
Observation discrepancy structural fulfilment. The state $x$ can be recovered through what the agent is able to know directly (i.e., without any inference, reflection, or computation), that is $y$ , $u$ , and their time derivatives. This amounts to the system being constructively observable.
(Adsf)
Action discrepancy structural fulfilment. The link between the goal $y_{μ r}$ and the action $u$ —required to reach that goal—is direct, in that the action is given as a function of the goal and its time derivatives. This amounts to the system being left-invertible.

A system such as (23) is said to be left-invertible with respect to $z$ if the action $u$ is a function of $z$ and its derivatives (see Property 5 of [42]). Thus, for a dynamical system to fulfil both (Odsf) and (Adsf), one needs to have a function $ω$ such that both the state $x$ and the action $u$ can be expressed in terms of $ω$ and its time derivatives. This corresponds to differential flatness [5,43,44], a property shared by a great number of practical dynamical systems (see, e.g., [45] and the references therein).

3.2.3. Motivation Through Direct and Inverse Views

The fundamental property of flat systems is that all their solutions can be functionally parameterised by a finite number of functions and their time derivatives. Although flatness is a relatively recent notion, introduced in control theory in the 1990s [5,43,44], it actually has a long history: a similar notion— of systems of undetermined differential equations integrable without integration—dates back to Hilbert [46] and Cartan [47]. Indeed, the control system $\dot{x} = f (x, u)$ can be seen as an underdetermined differential system consisting of n equations ${\dot{x}}_{i} = f_{i} (x, u)$ , for $i = 1, \dots, n$ , and $n + m$ variables (n states and m action variables). The difference between the number of variables and the number of equations gives the degrees of freedom number of the system. It follows that m functions can be chosen freely. In the context of control systems, one usually chooses freely the input $u (t) = (u_{1} (t), \dots, u_{m} (t))$ and then integrates in order to compute the state $x (t)$ . But is this the only way to achieve this?

In order to answer this question, consider the following simple single-input system:

\begin{matrix} {\dot{x}}_{1} & = x_{2} \end{matrix}

(24a)

\begin{matrix} {\dot{x}}_{2} & = u \end{matrix}

(24b)

We can freely choose $u (t)$ , then integrate it once to compute $x_{2} (t)$ , and then integrate it a second time to compute $x_{1} (t)$ . Let us now choose freely $x_{1} (t)$ , differentiate it once to get $x_{2} (t)$ , and then differentiate $x_{2} (t)$ to obtain $u (t)$ . It follows that the system with Equation (24) admits two functional parameterisations: one via the input, for which we have to integrate twice (which is, in the general case of $\dot{x} = f (x, u)$ , difficult and, sometimes, impossible analytically), and one via the state $x_{1}$ , for which we have to differentiate twice (which is always possible in a straightforward way). Hence, there are underdetermined differential systems (also known as control systems) that are solvable without integration, namely, differentially flat systems.

3.2.4. Formal Definition

Let us consider the central definition of this subsection, stated for systems with specified parameters, for ease of reading (see, e.g., [15,42,45]; see also the Supplementary Material, File AIandFP-FlatnessAndSRR-HMTPKF-2026-SupplMaterial-v1.pdf, Section D and [48] for a Python library, version 0.10.2).

Definition 8 (Differential flatness).

Consider a model with variables $z (t) = (z_{1} (t), \dots,$ $z_{r} (t)) \in$ $R^{r}$ , fluctuations $ζ (t) = (ζ_{1} (t), \dots,$ $ζ_{s} (t)) \in R^{s}$ , and equations

$\begin{matrix} Φ (z, \dot{z}, \dots, z^{(ρ_{z})}, ζ, \dot{ζ}, \dots, ζ^{(ρ_{ζ})}) & = 0 \end{matrix}$ (25)

The model is called differentially flat if there exists an m-tuple of variables, $ω (t) = (ω_{1} (t),$ $\dots,$ $ω_{m} (t))$ , named flat outputs, with the following three properties.

Endogenous character. We have (locally and generically)
$\begin{matrix} ω & = h (z, \dot{z}, \dots, z^{(η_{z})}) η_{z} \in N \end{matrix}$ (26)

In other words, the components of the flat output are combinations of the system’s variables.

Functional parameterisation.

The system’s variables $z$ can be (locally and generically) expressed through the flat output and a finite number of its derivatives:
$\begin{matrix} z & = A_{z} (ω, \dot{ω}, \dots, ω^{(α_{z})}, ζ, \dot{ζ}, \dots, ζ^{(α_{ζ})}) \end{matrix}$ (27)
with $α_{z}, α_{ζ}$ integers, such that the system’s equations
$\begin{matrix} Φ (A_{z} {\dot{A}}_{z}, \dots, A_{z}^{(ρ_{z})}, ζ, \dot{ζ}, \dots, ζ^{(ρ_{ζ})}) & = 0 \end{matrix}$
are identically verified.

Differential independence. The components of the flat output are differentially independent; i.e., any differential relation
$\begin{matrix} Ξ (ω, \dot{ω}, \dots, ω^{(β)}) & = 0 \end{matrix}$
is necessarily trivial: $Ξ \equiv 0$ .

Remark 3 (Differential flatness–state-space form case).

In case the model has the following form:

$\begin{matrix} \dot{x} & = f (x, u) + ζ \end{matrix}$ (28)

with state $x (t) \in R^{n}$ , action $u (t) \in R^{m}$ , and fluctuation $ζ (t) \in R^{n}$ , the functional parametrisation (27) becomes

$\begin{matrix} x & = A (ω, \dot{ω}, \dots, ω^{(α_{x})}, ζ, \dot{ζ}, \dots, ζ^{(α_{x})}) \end{matrix}$ (29a)

$\begin{matrix} u & = B (ω, \dot{ω}, \dots, ω^{(α_{u})}, ζ, \dot{ζ}, \dots, ζ^{(α_{u})}) \end{matrix}$ (29b)

Remark 4.

Note that the conventional definition of differential flatness is made for determinsitic (mean) models:

$\begin{matrix} \dot{x} & = f (x, u) \end{matrix}$ (30)

with state $x (t) \in R^{n}$ and action $u (t) \in R^{m}$ , in which case the functional parametrisation (27) becomes

$\begin{matrix} x & = A (ω, \dot{ω}, \dots, ω^{(α_{x})}) \end{matrix}$ (31a)

$\begin{matrix} u & = B (ω, \dot{ω}, \dots, ω^{(α_{u})}) \end{matrix}$ (31b)

The fluctuations considered above in (29a) and (29b) are not necessarily supposed to be known functions of time, but rather are seen as generic, sufficiently differentiable functions. The relations (29) are valid whatever the fluctuations ζ may be at this structural level. See Section 3.2.5 below for a discussion on this functional parametrisation.

Remark 5 (Local genericity).

As in Remark 1, the relations of endogenous character in (26) and of functional parameterisation in (27), naturally yield implicit functional relations:

$\begin{matrix} H (ω, z, \dot{z}, \dots, z^{(η_{z})}) & = 0 \end{matrix}$ (32a)

$\begin{matrix} Λ_{z} (z, ω, \dot{ω}, \dots, ω^{(α_{z})}, ζ, \dot{ζ}, \dots, ζ^{(α_{ζ})}) & = 0 \end{matrix}$ (32b)

And, locally, in the neighbourhood of generic regular points (i.e., points where the Jacobian of H with respect to ω and $Λ_{z}$ with respect to $z$ is regular), we can solve (32a) for ω and (32b) for $z$ using the implicit function theorem.

Let us note that the differential flatness definition is made in the presence of fluctuations. The latter may be deterministic or smooth stochastic functions (i.e., differentiable up a certain order). We will consider relations furnishing, in particular, the action as a functional of the sensory and fluctuation paths. This enables one to study precisely and quantitatively the influence that each perturbation function may exert on the action. Moreover the relation yielding the action—respectively, the state—as a function of sensor and fluctuation paths is generic, in the sense that it is valid for any (sufficiently smooth) sensory and fluctuation paths. In this sense, the notion of differential flatness is agnostic with respect to the stochasticity of the generative model. The latter may be deterministic (as is the case for mean generative models), subject to deterministic, but unkown, perturbations or subject to stochastic (and sufficiently smooth) fluctuations. The previous definition readily affords the following characterisation:

Proposition 1.

A differentially flat system in state-space form is a system like (28), i.e., $\dot{x} = f (x, u)$ , which is observable and left invertible with respect to ω.

To summarise what has been said so far, in order to express what we think we know about the system (our beliefs) and what we wish for the system (our expectations), we require a model structure that expresses first the agent system’s constructive observability (what we know or believe) and second the agent system’s left invertibility (what we expect and where the action $u$ is expressed in terms of the flat output goal, $ω_{r}$ ). The latter amounts to differential flatness, which is no more than the generator character of the flat output, i.e., the functional parameterisation and the differential polynomial independence of the flat output components. The last item enables one to choose the various components $ω_{i r} (t)$ of $ω_{r} (t)$ independently of each other. Both properties are reminiscent of the basis notion in vector spaces (i.e., the minimally generating and maximally independent characters); indeed, in a differential algebraic setting, the notion of flat output corresponds to a differential transcendence basis (see [5]).

An interesting point of contact with active inference is that optimisation of expected free energy functionals implies a high degree of mutual information between different components of a model (specifically, between states and observations). This implies precise mappings from actions, via states, to observations. Crucially, that same mutual information means that we would expect observations to be highly informative about states and, possibly, actions. The potential to recover subsets of variables from others in such models is heuristically compatible with the differential flatness concept. Furthermore, the need for differential independence has an interesting link with the interpretation of variational free energy as an objective function for constrained maximum entropy inference—where in the absence of the ‘energy’ constraints, the best configuration is that with maximum entropy without mutual constraints between the components of a system.

Remark 6 (Tracking an arbitrary output).

Suppose that the agent’s model is differentially flat. The agent’s goal $y_{r}$ may coincide with the flat output goal: $ω_{r} = y_{r}$ . If this is not the case, consider the expression of the agent’s output $y$ as a function of the flat output ω and its derivatives:

$\begin{matrix} y & = A_{y} (ω, \dot{ω}, \dots, ω^{(α_{y})}, ζ, \dot{ζ}, \dots, ζ^{(α_{ζ})}) \end{matrix}$ (33)

and view this expression as a differential equation in ω:

$\begin{matrix} ω^{(α_{y})} & = {\tilde{A}}_{y} (ω, \dot{ω}, \dots, ω^{(α_{y} - 1)}, y, ζ, \dot{ζ}, \dots, ζ^{(α_{ζ})}) \end{matrix}$ (34)

Substituting $y$ by the goal $y_{r}$ yields the flat output goal $ω_{r}$ as a solution of

$\begin{matrix} ω_{r}^{(α_{y})} & = {\tilde{A}}_{y} (ω_{r}, {\dot{ω}}_{r}, \dots, ω_{r}^{(α_{y} - 1)}, y_{r}, ζ, \dot{ζ}, \dots, ζ^{(α_{ζ})}) \end{matrix}$ (35)

3.2.5. Functional Parameterisation

The functional parameterisation property is an essential, if not by far the most essential feature, of differential flatness. Indeed, the original model

\begin{matrix} \dot{x} & = f (x, u) + ζ_{x} \end{matrix}

(36a)

\begin{matrix} y & = h (x) + ζ_{y} \end{matrix}

(36b)

is totally equivalent to its functional parametric form:

\begin{matrix} x & = A_{m} (ω, \dot{ω}, \dots, ω^{(ρ_{x})}, \dot{ζ}, \dots, ζ^{(ρ_{x})}) \end{matrix}

(37a)

\begin{matrix} u & = B_{m} (ω, \dot{ω}, \dots, ω^{(ρ_{u})}, \dot{ζ}, \dots, ζ^{(ρ_{u})}) \end{matrix}

(37b)

\begin{matrix} ζ & = (ζ_{x}, ζ_{y}) \end{matrix}

(37c)

And, crucially, the functional parametric form (37) is quasi-static, whereas the original model (36) has a dynamical form. Hence, when one wishes to deal with (36), one is naturally tempted to seek the solution of the ODE system (36a), and most often, an analytical solution cannot be found. But when dealing with the functional parametric form (37), all system variables, i.e., $x$ and $u$ here, are parametrised by the function $ω$ , since once we know the function $ω$ , then we readily know the functions $x$ and $u$ through (37) (assuming, in a structural step such as this one, that the fluctuations $ζ$ are known or measured).

Note that this parametrisation is a pathwise one and that it carries over the whole fluctuation; it can thus be envisioned either as a tube around the mean (see, e.g., [27,28,36]) or as a function sheaf (see, e.g., [49,50]; see also [51] for an elementary introduction and [52] for more complete while still accessible references).

Thus, the action, the control $u$ , and the state $x$ are envisioned as functionals of the flat output and of the fluctuations, i.e., as functions over spaces of functions:

\begin{matrix} x : F^{m} & \times F^{n + p} ⟶ F^{n} \\ ( & ω, ζ) ⟶ A_{m} (ω, \dot{ω}, \dots, ω^{(ρ_{x})}, \dot{ζ}, \dots, ζ^{(ρ_{x})}) \\ u : F^{m} & \times F^{n + p} ⟶ F^{m} \end{matrix}

(38a)

\begin{matrix} ( & ω, ζ) ⟶ B_{m} (ω, \dot{ω}, \dots, ω^{(ρ_{u})}, \dot{ζ}, \dots, ζ^{(ρ_{u})}) \\ ζ & = (ζ_{x}, ζ_{y}), F \subseteq C^{ρ} (R), ρ = \max (ρ_{x}, ρ_{u}) \end{matrix}

(38b)

And it is the study of these functionals that may be of great interest to the active inference community.

3.3. Conceptual Similarities

We can now see—with a simple concrete example—that both differential flatness and free energy minimisation enforce an inverse mapping from the sensed output to the action, namely, the control input.

Example 3 (Simple generic example; similarities).

Recall the following simple example with scalar action (control) and sensor output:

$\begin{matrix} {\dot{x}}_{1} & = x_{2} + ζ_{x_{1}} \end{matrix}$ (39a)

$\begin{matrix} {\dot{x}}_{2} & = f (x_{1}, x_{2}) + u + ζ_{x_{2}} \end{matrix}$ (39b)

$\begin{matrix} y & = x_{1} + ζ_{y} \end{matrix}$ (39c)

This model is differentially flat with flat output y. Indeed, the functional parameterisation is obtained as follows. Equation (39c) yields

$\begin{matrix} x_{1} & = y - ζ_{y} \end{matrix}$ (40)

Equation (39a) yields

$\begin{matrix} x_{2} & = {\dot{x}}_{1} - ζ_{x_{1}} = \dot{y} - {\dot{ζ}}_{y} - ζ_{x_{1}} \end{matrix}$ (41)

Then, using Equation (39b),

$\begin{matrix} u & = \overset{. .}{y} - f (y - ζ_{y}, \dot{y} - {\dot{ζ}}_{y} - ζ_{x_{1}}) - {\overset{. .}{ζ}}_{y} - {\dot{ζ}}_{x_{1}} - ζ_{x_{2}} \end{matrix}$ (42)

Thus, the functional parametrisation associated with the differential flatness property is

$\begin{matrix} x_{1} & = y - ζ_{y} \end{matrix}$ (43a)

$\begin{matrix} x_{2} & = \dot{y} - {\dot{ζ}}_{y} - ζ_{x_{1}} \end{matrix}$ (43b)

$\begin{matrix} u & = \overset{. .}{y} - f (y - ζ_{y}, \dot{y} - {\dot{ζ}}_{y} - ζ_{x_{1}}) - {\overset{. .}{ζ}}_{y} - {\dot{ζ}}_{x_{1}} - ζ_{x_{2}} \end{matrix}$ (43c)

We see in the last equation the inverse mapping from the sensory output $y (y)$ to the control input $u (t)$ . Note that this relation embeds all the influences of fluctuations on the action, since it is a functional in y $ζ_{x_{1}}$ , $ζ_{x_{2}}$ , and $ζ_{y}$ .

In the case of a deterministic system,

$\begin{matrix} {\dot{x}}_{1} & = x_{2} \end{matrix}$ (44a)

$\begin{matrix} {\dot{x}}_{2} & = f (x_{1}, x_{2}) + u \end{matrix}$ (44b)

$\begin{matrix} y & = x_{1} \end{matrix}$ (44c)

The parametrisation becomes

$\begin{matrix} x_{1} & = y \end{matrix}$ (45a)

$\begin{matrix} x_{2} & = \dot{y} \end{matrix}$ (45b)

$\begin{matrix} u & = \overset{. .}{y} - f (y, \dot{y}) \end{matrix}$ (45c)

Note that (44) is dynamically equivalent to (45).

Then, minimising the free energy $F$ enforces the dynamical mode (assuming it is possible to set all terms of this free energy to zero, at least approximately), i.e., the dynamical model obtained without fluctuations:

$\begin{matrix} {\dot{μ}}_{x_{1}} & = μ_{x_{2}} \end{matrix}$ (46a)

$\begin{matrix} {\dot{μ}}_{x_{2}} & = f (μ_{x_{1}}, μ_{x_{2}}) + u \end{matrix}$ (46b)

$\begin{matrix} y & = μ_{x_{1}} \end{matrix}$ (46c)

where $μ_{*}$ , $* \in {x_{1}, x_{2}, u}$ , is the mode of ∗ under the recognition density. By virtue of the previous equivalence, (46) is dynamically equivalent to

$\begin{matrix} μ_{x_{1}} & = y \end{matrix}$ (47a)

$\begin{matrix} μ_{x_{2}} & = {\dot{μ}}_{y} \end{matrix}$ (47b)

$\begin{matrix} u & = \overset{. .}{y} - f (y, \dot{y}) \end{matrix}$ (47c)

We now see that the minimisation of free energy enforces an (at least approximate) inverse mapping from the sensed output to the action. There is a further relationship that we will return to later, predicated upon generalised coordinates of motion, which we have not addressed here and which relates to the final equation above. As the action influences the rate of change of the states, which then influence the sensory data, the gradient of current sensory data with respect to the action is zero. This can be seen explicitly by noting that there is no term in the free energy in which sensory data and the mode of the action jointly appear. However, as highlighted here, there can be a non-zero gradient associated with temporal derivatives of the sensory data (i.e., higher orders of generalised coordinates of motion). This is essential for the reflexive formulation of action under active inference.

4. References, Flatness-Based Trajectory Tracking, and Perceptual and Active Inferences

4.1. Equivalence to Linearity

4.1.1. Differential Flatness Characterisation

The class of differentially flat systems is—despite the fact that it occurs quite frequently in practice—the simplest nonlinear class with respect to the feedback equivalence classes. Indeed, we have the following.

Proposition 2.

A system is flat if, and only if, it is linearisable by endogenous feedback and a change of coordinates.

A dynamic feedback is called endogenous if it does not include any external dynamics. More precisely, the following holds.

Definition 9.

Consider the dynamics $\dot{x} = f (x, u) + ζ$ . The feedback

$\begin{matrix} u & = φ (x, ξ, ζ, \dot{ζ}, \dots, ζ^{(σ_{u})}, v) \end{matrix}$ (48a)

$\begin{matrix} \dot{ξ} & = ψ (x, ξ, ζ, \dot{ζ}, \dots, ζ^{(σ_{v})}, v) \end{matrix}$ (48b)

(where $v$ is the new input) is called a dynamic endogenous feedback if the original dynamics $\dot{x} = f (x, u) + ζ$ is equivalent to the transformed one

$\begin{matrix} \dot{x} & = f (x, φ (x, ξ, ζ, \dot{ζ}, \dots, ζ^{(σ_{u})}, v)) \end{matrix}$ (49a)

$\begin{matrix} \dot{ξ} & = ψ (x, ξ, ζ, \dot{ζ}, \dots, ζ^{(σ_{v})}, v) \end{matrix}$ (49b)

Two systems are called equivalent if there exists an invertible transformation that exchanges their trajectories.

Crucially the preceding linearisation is not local but global, and the category of flat systems is not very far from the one of linear systems, since they are equivalent through endogenous feedback and coordinate change.

4.1.2. Dynamical Extension Algorithm

This procedure enables one to determine if an m-tuple $ω = (ω_{1}, \dots, ω_{m})$ is a flat output or not and to obtain a linearizing feedback.

Phase I—Weak Brunovský Index Gathering

(1)
Differentiate $ω_{1}$ until a combination of control inputs appears; denote by $κ_{1}$ the number of successive differentiations:
$\begin{matrix} ω_{1}^{(κ_{1})} & = γ_{1} (ω, \dot{ω}, \dots, ω^{(κ_{1} - 1)}, u, \dot{u}, \dots, u^{(σ_{u 1})}, ζ, \dot{ζ}, \dots, ζ^{(σ_{ζ 1})}) \end{matrix}$
with $σ_{u 1} ⩽ κ_{1} - 1$ and $σ_{ζ 1} ⩽ κ_{1}$ .
(2)
Differentiate $ω_{2}$ until a combination of control inputs (independent of the previous ones) appears; denote by $κ_{2}$ the number of successive differentiations:
$\begin{matrix} ω_{2}^{(κ_{2})} & = γ_{2} (ω, \dot{ω}, \dots, ω^{(κ_{2} - 1)}, u, \dot{u}, \dots, u^{(σ_{u 2})}, ζ, \dot{ζ}, \dots, ζ^{(σ_{ζ 2})}) \end{matrix}$
with $σ_{u 2} ⩽ κ_{2} - 1$ and $σ_{ζ 2} ⩽ κ_{2}$ .

⋮
(m)
Differentiate $ω_{m}$ until a combination of control inputs (independent of the previous ones) appears; denote by $κ_{m}$ the number of successive differentiations:
$\begin{matrix} ω_{m}^{(κ_{m})} & = γ_{m} (ω, \dot{ω}, \dots, ω^{(κ_{m} - 1)}, u, \dot{u}, \dots, u^{(σ_{u m})}, ζ, \dot{ζ}, \dots, ζ^{(σ_{ζ m})}) \end{matrix}$
with $σ_{u m} ⩽ κ_{m} - 1$ and $σ_{ζ m} ⩽ κ_{m}$ .

Phase II—Flatness Character Determination

Then, if $κ_{1} + \dots + κ_{m} = n$ (n being the state dimension), the system admits $ω$ as a flat output. If not, $ω$ is not a flat output.

Phase III—Linearizing Feedback

In case $ω$ is a flat output, the linearizing feedback is given by

\begin{matrix} γ_{1} (ω, \dot{ω}, \dots, ω^{(κ_{1} - 1)}, u, \dot{u}, \dots, u^{(σ_{u 1})}, ζ, \dot{ζ}, \dots, ζ^{(σ_{ζ 1})}) & = v_{1} \end{matrix}

(50a)

\begin{matrix} γ_{2} (ω, \dot{ω}, \dots, ω^{(κ_{2} - 1)}, u, \dot{u}, \dots, u^{(σ_{u 2})}, ζ, \dot{ζ}, \dots, ζ^{(σ_{ζ 2})}) & = v_{2} \end{matrix}

(50b)

⋮

\begin{matrix} γ_{m} (ω, \dot{ω}, \dots, ω^{(κ_{m} - 1)}, u, \dot{u}, \dots, u^{(σ_{u m})}, ζ, \dot{ζ}, \dots, ζ^{(σ_{ζ m})}) & = v_{m} \end{matrix}

(50c)

where $v_{1}, \dots, v_{m}$ are the new action (new control input) variables. Since in Phase I, the $γ_{i}$ s ( $i = 1, \dots, m$ ) are functionally independent, the latter Equations (50) is invertible in $u_{1},$ $\dots, u_{m}$ :

\begin{matrix} u_{1} = & {\tilde{γ}}_{1} (ω, \dot{ω}, \dots, ω^{(σ_{{\tilde{κ}}_{1}})}, v, \dot{v}, \dots, v^{(σ_{v 1})}, ζ, \dot{ζ}, \dots, ζ^{({\tilde{σ}}_{ζ 1})}) \end{matrix}

(51a)

\begin{matrix} u_{2} = & {\tilde{γ}}_{2} (ω, \dot{ω}, \dots, ω^{(σ_{{\tilde{κ}}_{2}})}, v, \dot{v}, \dots, v^{(σ_{v 2})}, ζ, \dot{ζ}, \dots, ζ^{({\tilde{σ}}_{ζ 2})}) \end{matrix}

(51b)

⋮

\begin{matrix} u_{m} = & {\tilde{γ}}_{m} (ω, \dot{ω}, \dots, ω^{(σ_{{\tilde{κ}}_{m}})}, v, \dot{v}, \dots, v^{(σ_{v m})}, ζ, \dot{ζ}, \dots, ζ^{({\tilde{σ}}_{ζ m})}) \end{matrix}

(51c)

with $v = (v_{1}, \dots, v_{m})$ .

The original dynamics $\dot{x} = f (x, u) + ζ$ is then transformed, via the linearising endogenous feedback (50), to a linear dynamics of the form

\begin{matrix} ω_{1}^{(κ_{1})} & = v_{1} \end{matrix}

(52a)

⋮

\begin{matrix} ω_{m}^{(κ_{m})} & = v_{m} \end{matrix}

(52b)

with the new input $v = (v_{1}, \dots, v_{m})$ .

Thus, the agent’s model (36) is equivalent, through the feedback (50), to the so-called flat output dynamics:

\begin{matrix} ω_{1}^{(κ_{1})} & = γ_{1} (ω, \dots, ω^{(κ_{1} - 1)}, u, \dots, u^{(σ_{u 1})}, ζ, \dots, ζ^{(σ_{ζ 1})}) \end{matrix}

(53a)

⋮

\begin{matrix} ω_{m}^{(κ_{m})} & = γ_{m} (ω, \dots, ω^{(κ_{m} - 1)}, u, \dots, u^{(σ_{u m})}, ζ, \dots, ζ^{(σ_{ζ m})}) \end{matrix}

(53b)

Thus, the original model (36) with n equations has been reduced, exactly, i.e., without any approximation, to the flat output dynamics (53) with m equations, where m is, in most practical cases, significantly smaller than n.

4.2. Differential Flatness and Controllability

A natural question a reader may ask is as follows: when is the differential flatness property verifiable? In other words, what are the checkable conditions ensuring flatness for a given system? The answer, for general nonlinear systems, is still an open problem. There are some conditions for restricted system classes or feedback equivalence classes. For instance, conditions are known for single-input systems and for static state feedback equivalence (see the Supplementary Material, File AIandFP-FlatnessAndSRR-HMTPKF-2026-SupplMaterial-v1.pdf, Section C).

There are some simple classes that are trivially flat. In the case of single-input systems, the class of systems that are, up to a feedback equivalence, in a cascade form like the following,

\begin{matrix} {\dot{x}}_{1} & = f_{1} (x_{1}, x_{2}) \end{matrix}

(54a)

\begin{matrix} {\dot{x}}_{2} & = f_{2} (x_{1}, x_{2}, x_{3}) \end{matrix}

(54b)

⋮

\begin{matrix} {\dot{x}}_{n} & = f_{n} (x_{1}, \dots, x_{n}, u) \end{matrix}

(54c)

with the functions $f_{i}$ being invertible in their last argument, are differentially flat, with $x_{1}$ as a flat output. The systems that are, up to a feedback equivalence, in a form with a finite number m of blocks like (54) are also differentially flat. We shall nevertheless see that differential flatness is a very strong property, as the following proposition suggests.

Proposition 3.

A differentially flat system is globally controllable.

Proof.

The global controllability property (see Definition 7) amounts to the following: for any initial and final states $x_{0}$ and $x_{1}$ of $R^{n}$ , and any initial and final times $t_{0}$ and $t_{1}$ , there exists a control law $u \in F_{u}$ , in the so-called space of admissible controls $F_{u}$ , driving the model from the initial state $x (t_{0}) = x_{0}$ to the final state $x (t_{1}) = x_{1}$ . Consider now a flat system with equations

$\begin{matrix} \dot{x} & = f (x, u) \\ y & = h (x) \end{matrix}$

with flat output $ω$ . We thus have, by virtue of the flatness property, the following functional parametrisation:

$\begin{matrix} x & = A (ω, \dot{ω}, \dots, ω^{(ρ_{x})}) \\ u & = B (ω, \dot{ω}, \dots, ω^{(ρ_{u})}) \end{matrix}$

Consider two arbitrary states $x_{0}$ and $x_{1}$ in $R^{n}$ , and $t_{0}, t_{1} > 0$ . Let $ω_{r} (t)$ be a polynomial such that

$\begin{matrix} x (t_{0}) & = A (ω_{r} (t_{0}), {\dot{ω}}_{r} (t_{0}), \dots, ω_{r}^{(ρ_{x})} (t_{0})) = x_{0} \\ x (t_{1}) & = A (ω_{r} (t_{1}), {\dot{ω}}_{r} (t_{1}), \dots, ω_{r}^{(ρ_{x})} (t_{1})) = x_{1} \end{matrix}$

The control steering the system from $ω_{0}$ to $ω_{1}$ is

$\begin{matrix} u_{r} & = B (ω_{r}, {\dot{ω}}_{r}, \dots, ω_{r}^{(ρ_{u})}) \end{matrix}$

Thus the system is globally controllable. □

Example 4 (Oculomotor dynamics and flatness).

As a simple example, consider a model of oculomotor control, a problem that has been addressed extensively through active inference: see, e.g., [53,54,55,56,57]. Here, we will consider a simplified kinematics and dynamics of eye movement. Consider a human agent whose eyes are located at a fixed distance d from a vertical visual scene. Let us concentrate of the movement of a single eye and denote the pupil’s center by E. The eye tracks a point P with coordinates $X, Y$ in the visual scene’s fixed reference frame. The agent’s head is supposed to be fixed. Let $P_{0}$ , with coordinates $X_{0}, Y_{0}$ , be the orthogonal projection of E on the visual scene plane. We shall first describe the generative model and then demonstrate that this is differentially flat. We then consider how it might be linearised as outlined above.

According to Listing’s law [58], two angles characterise the movement of the pupil: ψ, the orientation, or yaw angle, and ϕ, the elevation, or pitch angle. Let us note that this model is extremely similar to the one of a two-degree-of-freedom gimbal system (see, e.g., [59]). We then have

$\begin{matrix} X & = X_{0} + d tan ψ + ζ_{X} \end{matrix}$ (55a)

$\begin{matrix} Y & = Y_{0} + d tan ϕ + ζ_{Y} \end{matrix}$ (55b)

Let us set

$\begin{matrix} v_{ψ} & = \dot{ψ}, v_{ϕ} = \dot{ϕ} \end{matrix}$

The differentiation of the expressions of X and X in (55) with respect to time yields (See Figure 2)

$\begin{matrix} \dot{X} & = d v_{ψ} (1 + {tan}^{2} ψ) + {\dot{ζ}}_{X} \end{matrix}$ (56a)

$\begin{matrix} \dot{Y} & = d v_{ϕ} (1 + {tan}^{2} ϕ) + {\dot{ζ}}_{Y} \end{matrix}$ (56b)

Then, using the expressions of X and Y in (55), we get expressions of $1 + {tan}^{2} ψ$ and $1 + {tan}^{2} ϕ$ as functions of X and Y:

$\begin{matrix} d (1 + {tan}^{2} ψ) & = \frac{d^{2} + {(X - X_{0} - ζ_{X})}^{2}}{d} d (1 + {tan}^{2} ϕ) = \frac{d^{2} + {(Y - Y_{0} - ζ_{Y})}^{2}}{d} \end{matrix}$

Thus, the relations expressing $\dot{X}$ and $\dot{Y}$ in (56) become

$\begin{matrix} \dot{X} & = v_{ψ} \frac{d^{2} + {(X - X_{0} - ζ_{X})}^{2}}{d} + {\dot{ζ}}_{X} \end{matrix}$ (57a)

$\begin{matrix} \dot{Y} & = v_{ϕ} \frac{d^{2} + {(Y - Y_{0} - ζ_{Y})}^{2}}{d} + {\dot{ζ}}_{Y} \end{matrix}$ (57b)

The relationship between $v_{ψ}, v_{ϕ}$ and the torques exerted by the eye muscles can be written as follows:

$\begin{matrix} 2 I_{e} {\dot{v}}_{ψ} & = u_{ψ} \end{matrix}$ (58a)

$\begin{matrix} I_{e} {\dot{v}}_{ϕ} & = u_{ϕ} \end{matrix}$ (58b)

with $I_{e}$ being the rotational inertia of the eye, supposed to be spherical (see the Supplementary Material, Section A, File AIandFP-FlatnessAndSRR-HMTPKF-2026-SupplMaterial-v1.pdf, for the reason why the inertia is doubled in the yaw equation above). Let us set

$\begin{matrix} x & = (x_{X}, x_{Y}, x_{ψ}, x_{ϕ}), x_{X} = X, x_{Y} = Y, x_{v_{ψ}} = ψ, x_{v_{ϕ}} = ϕ \end{matrix}$

The full generative model will then be written as follows:

$\begin{matrix} {\dot{x}}_{X} = x_{v_{ψ}} \frac{d^{2} + {(x_{X} - X_{0} - ζ_{X})}^{2}}{d} + {\dot{ζ}}_{X} \end{matrix}$ (59a)

$\begin{matrix} {\dot{x}}_{Y} = x_{v_{ϕ}} \frac{d^{2} + {(x_{Y} - Y_{0} - ζ_{Y})}^{2}}{d} + {\dot{ζ}}_{Y} \end{matrix}$ (59b)

$\begin{matrix} 2 I_{e} & {\dot{x}}_{v_{ψ}} = u_{ψ} + ζ_{ψ} \end{matrix}$ (59c)

$\begin{matrix} I_{e} & {\dot{x}}_{v_{ϕ}} = u_{ϕ} + ζ_{ϕ} \end{matrix}$ (59d)

where $ζ_{*} (t)$ , $* \in {x, y, ψ, ϕ}$ , are fluctuations, which may be due to various alterations in human vision and the eye inertia $I_{e} = 2 m_{e} r_{e}^{2} / 5$ , with $m_{e}$ and $r_{e}$ being the eye’s mass and radius.

This model is differentially flat, with $ω = (x, y)$ as a flat output. Indeed, one has the following:

From the first two lines of (59), we get the expressions of $x_{v_{ψ}}$ and $x_{v_{ϕ}}$ :
$\begin{matrix} x_{v_{ψ}} & = \frac{d ({\dot{x}}_{X} - {\dot{ζ}}_{X})}{d^{2} + {(x_{X} - X_{0} - ζ_{X})}^{2}} \end{matrix}$ (60a)

$\begin{matrix} x_{v_{ϕ}} & = \frac{d ({\dot{x}}_{Y} - {\dot{ζ}}_{Y})}{d^{2} + {(x_{X} - Y_{0} - ζ_{Y})}^{2}} \end{matrix}$ (60b)

From the final two lines of (59), we get the expressions of the action, the control inputs $u_{ψ}$ and $u_{ϕ}$ :
$\begin{matrix} u_{ψ} & = 2 I_{e} \frac{d ({\overset{. .}{x}}_{X} - {\overset{. .}{ζ}}_{X}) (d^{2} + {(X - X_{0} - ζ_{X})}^{2}) - 2 d (X - X_{0} - ζ_{X}) {({\dot{x}}_{X} - {\dot{ζ}}_{X})}^{2}}{{(d^{2} + {(X - X_{0} - ζ_{X})}^{2})}^{2}} - ζ_{ψ} \end{matrix}$ (61a)

$\begin{matrix} u_{ϕ} & = I_{e} \frac{d ({\overset{. .}{x}}_{Y} - {\overset{. .}{ζ}}_{Y}) (d^{2} + {(Y - Y_{0} - ζ_{Y})}^{2}) - 2 d (Y - Y_{0} - ζ_{Y}) {({\dot{x}}_{Y} - {\dot{ζ}}_{Y})}^{2}}{{(d^{2} + {(Y - Y_{0} - ζ_{Y})}^{2})}^{2}} - ζ_{ϕ} \end{matrix}$ (61b)

Note that the previous parametrisation is a pathwise one and that it carries over all of the fluctuations. Let us consider the reference trajectories $x_{X r} (t)$ and $x_{Y r} (t)$ for x and y, depicted in Figure 3. More precisely, the plotted trajectory has the following form:

$\begin{matrix} x_{X r} (t) & = \frac{x_{h} - x_{l}}{2} (1 + tanh (γ (t - t_{r a i s e}))) + x_{l}, t & \in [0, T], \\ T & = 80 s, x_{l} = 1, x_{h} = 4, γ = 4, t_{r a i s e} = 20 s \end{matrix}$ (62)

where the simulation is performed on $[0, 80 s]$ , $γ$ is the stiffness of the trajectory, and $t_{r a i s e}$ is the time where $x_{X r}$ has reached half of the maximum. The minimum is ${lim}_{t \to - \infty} x_{X r} (t) = x_{l} = 1$ ; the maximum is ${lim}_{t \to + \infty} x_{X r} (t) = x_{h} = 4$ .

The open-loop components of the states $v_{ψ r}$ and $v_{ϕ r}$ , corresponding to Equation (60), are shown in Figure 4, together with the influence of the fluctuations $ζ_{X}$ , $ζ_{Y}$ and their first derivatives. These fluctuations were chosen to be

$\begin{matrix} ζ_{*} & = α ζ * \in {x, y, ψ, ϕ} \end{matrix}$ (63)

with ζ being of the form (16), $L = 8$ , $λ = 2$ , and $α = 2 \times 10^{- 6}$ . Recall the form (16):

$\begin{matrix} ζ (t) & = a_{0} + \sum_{j = 1}^{r} [a_{j} cos (\frac{2 π j}{L} t) + b_{j} sin (\frac{2 π j}{L} t)], r = ⌊ L / λ ⌋ \end{matrix}$ (64)

where $λ, L > 0$ , each $a_{j}$ and $b_{j}$ is an independent sample from $N (0, 1 / (2 r + 1))$ (where $N (μ, V)$ denotes the real normal distribution of mean $μ$ and variance V), and $⌊ \cdot ⌋$ is the floor function. Since $L = 8$ and $λ = 2$ , we have $r = 4$ .

The corresponding fluctuations $ζ_{X}$ and $ζ_{Y}$ are depicted in Figure 5.

The open-loop action components $v_{ψ r}$ and $v_{ϕ r}$ , corresponding to Equation (61), are described in Figure 6. Although the fluctuations in Figure 5 are extremely small, their influence is already noticeable, although quite modest, in the $u_{ϕ r}$ plot.

If one considers $α = 10^{- 4}$ , which forces $ζ_{X}$ and $ζ_{Y}$ to be still very small, the corresponding plots of $v_{ψ r}$ and $v_{ϕ r}$ change radically, as depicted in Figure 7.

Consider again Example 4 and its generative model:

$\begin{matrix} {\dot{x}}_{X} = x_{v_{ψ}} \frac{d^{2} + {(x_{X} - X_{0} - ζ_{X})}^{2}}{d} + {\dot{ζ}}_{X} \end{matrix}$ (65a)

$\begin{matrix} {\dot{x}}_{Y} = x_{v_{ϕ}} \frac{d^{2} + {(x_{Y} - Y_{0} - ζ_{Y})}^{2}}{d} + {\dot{ζ}}_{Y} \end{matrix}$ (65b)

$\begin{matrix} 2 I_{e} & {\dot{x}}_{v_{ψ}} = u_{ψ} + ζ_{ψ} \end{matrix}$ (65c)

$\begin{matrix} I_{e} & {\dot{x}}_{v_{ϕ}} = u_{ϕ} + ζ_{ϕ} \end{matrix}$ (65d)

To exactly linearise the model (65), one first differentiates (65c) and (65d), so that the action (control input) $u_{ψ}$ , $u_{ϕ}$ can appear:

$\begin{matrix} {\overset{. .}{x}}_{X} & = \frac{1}{2 d I_{e}} (d^{2} + {(x_{X} - X_{0} - ζ_{X})}^{2}) (u_{ψ} + ζ_{ψ}) + \frac{2 x_{v_{ψ}}}{d} ({\dot{x}}_{X} - {\dot{ζ}}_{X}) + {\overset{. .}{ζ}}_{X} \end{matrix}$ (66a)

$\begin{matrix} {\overset{. .}{x}}_{Y} & = \frac{1}{d I_{e}} (d^{2} + {(x_{Y} - Y_{0} - ζ_{Y})}^{2}) (u_{ϕ} + ζ_{ϕ}) + \frac{2 v_{ϕ}}{d} ({\dot{x}}_{Y} - {\dot{ζ}}_{Y}) + {\overset{. .}{ζ}}_{Y} \end{matrix}$ (66b)

or, in matrix form,

$\begin{matrix} (\begin{matrix} {\overset{. .}{x}}_{X} \\ {\overset{. .}{x}}_{Y} \end{matrix}) & = \frac{1}{2 d I_{e}} (\begin{matrix} δ_{X} & 0 \\ 0 & 2 δ_{Y} \end{matrix}) (\begin{matrix} u_{ψ} + ζ_{ψ} \\ u_{ϕ} + ζ_{ϕ} \end{matrix}) + \frac{2}{d} (\begin{matrix} x_{v_{ψ}} ({\dot{x}}_{X} - {\dot{ζ}}_{X}) \\ x_{v_{ϕ}} ({\dot{x}}_{Y} - {\dot{ζ}}_{Y}) \end{matrix}) + (\begin{matrix} {\overset{. .}{ζ}}_{X} \\ {\overset{. .}{ζ}}_{Y} \end{matrix}) \end{matrix}$ (67)

$\begin{matrix} δ_{X} & = d^{2} + {(X - X_{0} - ζ_{X})}^{2} δ_{Y} = d^{2} + {(Y - Y_{0} - ζ_{Y})}^{2} \end{matrix}$ (68)

To exactly linearise the dynamics (65), we shall equate ${({\overset{. .}{x}}_{X}, {\overset{. .}{x}}_{Y})}^{T}$ to ${(v_{X}, v_{Y})}^{T}$ , where $v_{X} (t)$ , $v_{Y} (t)$ are new action functions. Hence, we obtain

$\begin{matrix} \frac{1}{2 d I_{e}} (\begin{matrix} δ_{X} & 0 \\ 0 & 2 δ_{Y} \end{matrix}) (\begin{matrix} u_{ψ} + ζ_{ψ} \\ u_{ϕ} + ζ_{ϕ} \end{matrix}) + (\begin{matrix} \frac{2 x_{v_{ψ}} ({\dot{x}}_{X} - {\dot{ζ}}_{X})}{d} + {\overset{. .}{ζ}}_{X} \\ \frac{2 x_{v_{ϕ}} ({\dot{x}}_{Y} - {\dot{ζ}}_{Y})}{d} + {\overset{. .}{ζ}}_{Y} \end{matrix}) & = (\begin{matrix} v_{X} \\ v_{Y} \end{matrix}) \end{matrix}$ (69)

Then, the nonlinear dynamics (65) is transformed into

$\begin{matrix} {\overset{. .}{x}}_{X} & = v_{X} \\ {\overset{. .}{x}}_{Y} & = v_{Y} \end{matrix}$

This graphic is intended to illustrate the set-up of our oculomotion example. On the left, we see the set-up from above, with an eye looking towards a screen at distance d. The pair of equations in the lower left panel give the (linear) flows for the yaw and pitch of the eyeball. The graphic on the right shows the screen with the fixation point P relative to the orthogonal projection $P_{0}$ . The equations in the lower right panel show the corresponding (stochastic) motion of the fixation point. The expression in terms of fixation points provides a useful nonlinear system in which we can unpack the differential flatness concept. It also provides a clear example of a control problem, as we might expect the target of an eye movement to be the fixation location that helps us resolve uncertainty about something in visual space, as opposed to a desired oculomotor angle.

Simple reference trajectory examples $x_{X r} (t), x_{Y r} (t)$ .

Orientation and elevation speeds $v_{ψ r}$ and $v_{ϕ r}$ for very small fluctuations $x_{X r} (t), x_{Y r} (t)$ .

Fluctuations from Equation (63), with an extremely small scale ( $L = 8$ , $λ = 2$ , $α = 2 \times 10^{- 6}$ ). Red line is the mean; grey lines are the signals with fluctuations.

Orientation and elevation torques $u_{ψ r} (t), u_{ϕ r} (t)$ for extremely small fluctuations ( $L = 8$ , $λ = 2$ , $α = 2 \times 10^{- 6}$ ). Red lines are the means, i.e., without fluctuations; grey lines are the signals with fluctuations.

Orientation and elevation torques $u_{ψ r} (t), u_{ϕ r} (t)$ for very small fluctuations ( $L = 8$ , $λ = 2$ , $α = 10^{- 4}$ ). Red lines are the means, i.e., without fluctuations; grey lines are the signals with fluctuations.

4.3. Trajectory Design and Planning

In order to perform tracking of a predefined trajectory, one has first to design this trajectory, the goal of the subsequent action. In some cases, this trajectory will be obvious to design, e.g., for an eye to move along a line, or in a circular motion, or for an arm to grab an object when no obstacle is present. In other circumstances, the task of planning a trajectory may be highly complex, especially in cases where both the agent and some obstacles are moving. The literature on planning is huge, and it has been studied extensively in fields like robotics, where it plays a crucial role (see, e.g., [60]).

4.4. Synthesis Law Computations: Tracking Controller

4.4.1. General Action Tracking Law

There are numerous ways to achieve trajectory tracking with stability, i.e., ensure that the discrepancy in action $ε_{a c t}$ tends to zero asymptotically, and the corresponding literature is huge (see, e.g., [41] for a classic reference). For our purposes, i.e., the fulfilment of the inference guidelines, all of the laws described in [41] are inappropriate. This is because they do not rely on pathwise properties, and more importantly they do not rely on a basis-like property. In contrast, the flatness-based framework is inherently pathwise while embedding the physics of the agent’s model.

Considering a differentially flat model $\dot{x} = f (x, u)$ , we here wish to derive an action law (a controller) able to follow any reference trajectory $t \mapsto ω_{r} (t)$ . In order to compensate for model mismatch and poorly known initial conditions, one has to complement the open loop (obtained through flatness) with a closed-loop corrective term depending on the error $ω (t) - ω_{r} (t)$ .

Knowing that the dynamics is flat, with flat output $ω$ , it can be transformed via the linearising endogenous feedback (50) to a linear dynamics of the form

\begin{matrix} ω_{1}^{(κ_{1})} & = v_{1} \end{matrix}

(70a)

⋮

\begin{matrix} ω_{m}^{(κ_{m})} & = v_{m} \end{matrix}

(70b)

with the new input $v = (v_{1}, \dots, v_{m})$ . Then, the elementary tracking feedback law

\begin{matrix} v_{i} & = ω_{i r}^{(κ_{i})} - \sum_{j = 0}^{κ_{i} - 1} λ_{ω i j} (ω_{i}^{(j)} - ω_{i r}^{(j)}), & i = 1, \dots, m \\ = ω_{i r}^{(κ_{i})} - \sum_{j = 0}^{κ_{i} - 1} λ_{ω i j} e_{ω i}^{(j)}, & with e_{ω i} = ω_{i} - ω_{i r} \end{matrix}

(71)

with appropriately chosen $λ_{z i j}$ gains renders the error dynamics

\begin{matrix} e_{ω i}^{(κ_{i})} + \sum_{j = 0}^{κ_{i} - 1} λ_{ω i j} e_{ω i}^{(j)} & = 0, i = 1, \dots, m \end{matrix}

(72)

asymptotically stable. In vector form, the preceding law (71) is expressed as follows:

\begin{matrix} v_{i} & = ω_{r}^{(κ_{i})} - λ_{ω i} {(e_{ω i}^{〈 κ_{i} - 1 〉})}^{T} \end{matrix}

(73)

where $e_{ω i}^{〈 κ_{i} - 1 〉} = (e_{ω i}, {\dot{e}}_{ω i}, \dots, e_{ω i}^{(κ_{i} - 1)})$ and $λ_{ω i} = (λ_{0 i}, λ_{1 i}, \dots, λ_{(κ_{i} - 1) i})$ . Finally, the tracking action (tracking feedback control) law is expressed as follows:

\begin{matrix} u_{1} & = {\tilde{γ}}_{1} (ω, v, \dot{v}, \dots, v^{(σ_{v 1})}, ζ, \dot{ζ}, \dots, ζ^{({\tilde{σ}}_{ζ 1})}) \end{matrix}

(74a)

\begin{matrix} u_{2} & = {\tilde{γ}}_{2} (ω, v, \dot{v}, \dots, v^{(σ_{v 2})}, ζ, \dot{ζ}, \dots, ζ^{({\tilde{σ}}_{ζ 2})}) \end{matrix}

(74b)

⋮

\begin{matrix} u_{m} & = {\tilde{γ}}_{m} (ω, v, \dot{v}, \dots, v^{(σ_{v m})}, ζ, \dot{ζ}, \dots, ζ^{({\tilde{σ}}_{ζ m})}) \end{matrix}

(74c)

\begin{matrix} v & = (v_{1}, \dots, v_{m}) \end{matrix}

(74d)

\begin{matrix} v_{i} & = ω_{r}^{(κ_{i})} - λ_{z i} {(e_{z i}^{〈 κ_{i} - 1 〉})}^{T}, i = 1, \dots, m \end{matrix}

(74e)

which ensures the tracking of $ω$ to $ω_{r}$ , with stability through driving the discrepancy $ε_{a c t} = ω - ω_{r}$ to zero.

Remark 7 (Open-loop and model-free).

The preceding action law is named a linearising feedback controller due to the fact that the flat output dynamics is exactly linearised in a first step. Another possibility—most probably more fruitful—is to use one of the following control laws. A first possible choice is an open-loop controller, i.e., an action law obtained through the use of (29b), p. 13, in which ω is substituted with $ω_{r}$ , a reference trajectory, with this open-loop law being supplemented by a model-free controller, in the spirit of [61]. Other possible choices include (74) supplemented by a model-free controller, an ADRC (active disturbance rejection control) one [62,63], or a sliding mode control one [64,65].

Example 5 (Simple generic example; tracking).

Consider again the following simple example with scalar action (control) and sensor output:

$\begin{matrix} {\dot{x}}_{1} & = x_{2} + ζ_{x_{1}} \end{matrix}$ (75a)

$\begin{matrix} {\dot{x}}_{2} & = f (x_{1}, x_{2}) + u + ζ_{x_{2}} \end{matrix}$ (75b)

$\begin{matrix} y & = x_{1} + ζ_{y} \end{matrix}$ (75c)

In order to elaborate on the tracking action laws, we have to obtain the sensor output y dynamics, which is obtained by differentiating (75a), thus obtaining ${\overset{. .}{x}}_{1}$ :

$\begin{matrix} {\overset{. .}{x}}_{1} & = f (x_{1}, {\dot{x}}_{1}) + u + {\dot{ζ}}_{x_{1}} + ζ_{x_{2}} \end{matrix}$ (76)

and using (75c):

$\begin{matrix} \overset{. .}{y} & = f (y - ζ_{y}, \dot{y} - {\dot{ζ}}_{y}) + u + {\overset{. .}{ζ}}_{y} + {\dot{ζ}}_{x_{1}} + ζ_{x_{2}} \end{matrix}$ (77)

Setting $ϵ_{y} = y - y_{r}$ , we have $y = y_{r} + ϵ_{y}$ . Then, the y-dynamics (77) becomes

$\begin{matrix} {\overset{. .}{ϵ}}_{y} & = - {\overset{. .}{y}}_{r} + f (y - ζ_{y}, \dot{y} - {\dot{ζ}}_{y}) + u + {\overset{. .}{ζ}}_{y} + {\dot{ζ}}_{x_{1}} + ζ_{x_{2}} \end{matrix}$ (78)

The trajectory tracking goal is

$\begin{matrix} lim_{t \to \infty} ϵ_{y} & = lim_{t \to \infty} (y - y_{r}) = 0 \end{matrix}$ (79)

When expressing this goal in a dynamical setting, this renders $ϵ_{y}$ the solution of a target differential equation:

$\begin{matrix} {\overset{. .}{ϵ}}_{y} + λ_{y, 1} {\dot{ϵ}}_{y} + λ_{y, 2} ϵ_{y} & = 0 \end{matrix}$ (80)

with $λ_{y, 1}, λ_{y, 2} \in R$ such that all solutions of (80) are exponentially decreasing functions of time.

In order to transform (78) into (80), we set

$\begin{matrix} - {\overset{. .}{y}}_{r} + f (y - ζ_{y}, \dot{y} - {\dot{ζ}}_{y}) + u + {\overset{. .}{ζ}}_{y} + {\dot{ζ}}_{x_{1}} + ζ_{x_{2}} & = - λ_{y, 1} {\dot{ϵ}}_{y} - λ_{y, 2} ϵ_{y} \end{matrix}$ (81)

Hence, the control law ensuring the tracking of $y_{r} (t)$ with stability is given by

$\begin{matrix} u & = {\overset{. .}{y}}_{r} - f (y - ζ_{y}, \dot{y} - {\dot{ζ}}_{y}) - {\overset{. .}{ζ}}_{y} - {\dot{ζ}}_{x_{1}} - ζ_{x_{2}} - λ_{y, 1} {\dot{ϵ}}_{y} - λ_{y, 2} ϵ_{y} \end{matrix}$ (82)

This control law assumes the full knowledge of the fluctuations $ζ_{x_{1}}$ , $ζ_{x_{2}}$ , and $ζ_{y}$ . When this knowledge is not available, we have to derive a control law based on a deterministic generative model (which is all that the agent knows), estimate the fluctuations, and compensate for them (see, e.g., [66] for the so-called model-free control or [62,65] for other schemes).

4.4.2. Oculomotor Example

Example 6 (Oculomotor tracking).

Consider again Example 4 and its generative model:

$\begin{matrix} {\dot{x}}_{X} = x_{v_{ψ}} \frac{d^{2} + {(x_{X} - X_{0} - ζ_{X})}^{2}}{d} + {\dot{ζ}}_{X} \end{matrix}$ (83a)

$\begin{matrix} {\dot{x}}_{Y} = x_{v_{ϕ}} \frac{d^{2} + {(x_{Y} - Y_{0} - ζ_{Y})}^{2}}{d} + {\dot{ζ}}_{Y} \end{matrix}$ (83b)

$\begin{matrix} 2 I_{e} & {\dot{x}}_{v_{ψ}} = u_{ψ} + ζ_{ψ} \end{matrix}$ (83c)

$\begin{matrix} I_{e} & {\dot{x}}_{v_{ϕ}} = u_{ϕ} + ζ_{ϕ} \end{matrix}$ (83d)

and suppose that one wishes to track a reference trajectory $(x_{X r} (t), x_{Y r} (t))$ for $(X, Y)$ . We have seen in Example 4 that the following action feedback:

$\begin{matrix} \frac{1}{2 d I_{e}} & (\begin{matrix} δ_{X} & 0 \\ 0 & 2 δ_{Y} \end{matrix}) (\begin{matrix} u_{ψ} + ζ_{ψ} \\ u_{ϕ} + ζ_{ϕ} \end{matrix}) + (\begin{matrix} \frac{2 x_{v_{ψ}} ({\dot{x}}_{X} - {\dot{ζ}}_{X})}{d} + {\overset{. .}{ζ}}_{X} \\ \frac{2 x_{v_{ϕ}} ({\dot{x}}_{Y} - {\dot{ζ}}_{Y})}{d} + {\overset{. .}{ζ}}_{Y} \end{matrix}) = (\begin{matrix} v_{X} \\ v_{Y} \end{matrix}) \end{matrix}$ (84)

$\begin{matrix} δ_{X} = d^{2} + {(X - X_{0} - ζ_{X})}^{2} δ_{Y} = d^{2} + {(Y - Y_{0} - ζ_{Y})}^{2} \end{matrix}$ (85)

transforms the nonlinear dynamics (83) into the following linear dynamics:

$\begin{matrix} {\overset{. .}{x}}_{X} & = v_{X} \\ {\overset{. .}{x}}_{Y} & = v_{Y} \end{matrix}$

The trajectory tracking can then be enforced through the following tracking feedback law in $v_{X}$ , $v_{Y}$ :

$\begin{matrix} v_{X} & = {\overset{. .}{x}}_{X r} - λ_{X 1} {\dot{ε}}_{X} - λ_{X 0} ε_{X} \end{matrix}$ (86a)

$\begin{matrix} v_{Y} & = {\overset{. .}{x}}_{Y r} - λ_{Y 1} {\dot{ε}}_{y} - λ_{Y 0} ε_{y} \end{matrix}$ (86b)

$\begin{matrix} ε_{X} & = x_{X} - x_{X r} ε_{y} = x_{Y} - x_{Y r} \end{matrix}$ (86c)

where $λ_{X 0}$ , $λ_{X 1}$ , $λ_{Y 0}$ , and $λ_{Y 1}$ are suitable constants (the action or feedback control gains) ensuring the stability of the closed-loop error dynamics:

$\begin{matrix} {\overset{. .}{ε}}_{a c t} + λ_{1} {\dot{ε}}_{a c t} + λ_{0} ε_{a c t} = 0 \\ ε_{a c t} = (\begin{matrix} ε_{X} \\ ε_{y} \end{matrix}) λ_{0} = (\begin{matrix} λ_{X 0} \\ λ_{Y 0} \end{matrix}) λ_{1} = (\begin{matrix} λ_{X 1} \\ λ_{Y 1} \end{matrix}) \end{matrix}$

Here, it is sufficient to take all $λ_{*} > 0$ , $* \in {X_{0}, Y_{0}, X_{1}, Y_{1}}$ . Note that the preceding laws $v_{X}$ and $v_{Y}$ in (86) can be replaced by other ones, such as, for instance, model-free control ones (see, e.g., [66]). Finally, to express the original control law $u = (u_{ψ}, u_{ϕ})$ , we must invert (84), yielding $u_{ψ}$ and $u_{ϕ}$ as functions of $v_{X}$ and $v_{Y}$ :

$\begin{matrix} (\begin{matrix} u_{ψ} \\ u_{ϕ} \end{matrix}) & = \frac{d I_{e}}{δ_{X} δ_{Y}} (\begin{matrix} 2 δ_{Y} & 0 \\ 0 & δ_{X} \end{matrix}) [- (\begin{matrix} \frac{2 x_{v_{ψ}} ({\dot{x}}_{X} - {\dot{ζ}}_{X})}{d} + {\overset{. .}{ζ}}_{X} \\ \frac{2 x_{v_{ϕ}} ({\dot{x}}_{Y} - {\dot{ζ}}_{Y})}{d} + {\overset{. .}{ζ}}_{Y} \end{matrix}) + (\begin{matrix} v_{X} \\ v_{Y} \end{matrix})] - (\begin{matrix} ζ_{ψ} \\ ζ_{ϕ} \end{matrix}) \end{matrix}$ (87)

Then substituting (86a) into (87) yields the final action tracking controller:

$\begin{matrix} (\begin{matrix} u_{ψ} \\ u_{ϕ} \end{matrix}) & = \frac{d I_{e}}{δ_{X} δ_{Y}} (\begin{matrix} 2 δ_{Y} & 0 \\ 0 & δ_{X} \end{matrix}) [- (\begin{matrix} \frac{2 x_{v_{ψ}} ({\dot{x}}_{X} - {\dot{ζ}}_{X})}{d} + {\overset{. .}{ζ}}_{X} \\ \frac{2 x_{v_{ϕ}} ({\dot{x}}_{Y} - {\dot{ζ}}_{Y})}{d} + {\overset{. .}{ζ}}_{Y} \end{matrix}) + (\begin{matrix} {\overset{. .}{x}}_{X r} - λ_{X 1} {\dot{ε}}_{X} - λ_{X 0} ε_{X} \\ {\overset{. .}{x}}_{Y r} - λ_{Y 1} {\dot{ε}}_{y} - λ_{Y 0} ε_{y} \end{matrix})] - (\begin{matrix} ζ_{ψ} \\ ζ_{ϕ} \end{matrix}) \end{matrix}$ (88)

which ensures the tracking of $(x, y)$ to $(x_{X r}, x_{Y r})$ with stability, i.e., the fulfilment of (IG-Perfo) through driving the discrepancy $ε_{a c t} = {(ε_{X}, ε_{y})}^{T}$ to zero. The quantity sensed by the agent is the position $X, Y$ of the point P in the visual scene.

Remark 8 (A more realistic example).

Note that we could have considered a more realistic example than (83) by including, as in [53], a delay $τ_{s}$ in sensing and an advance $τ_{a}$ in acting. In order to achieve predictions, the agent uses a form of open-loop control. A possible choice could be the use of reference signals, using the functional parametrisation offered by the flatness property. In other words, using (61) after substitution of $(x, y)$ with $(x_{X r}, x_{Y r})$ , we get the open-loop action law:

$\begin{matrix} u_{ψ r} & = \frac{1}{I_{e}} \frac{d {\overset{. .}{x}}_{X r} (d^{2} + {(x_{X r} - x_{0})}^{2}) - 2 d (x_{X r} - x_{0}) {\dot{x}}_{X r}^{2}}{{(d^{2} + {(x_{X r} - x_{0})}^{2})}^{2}} \end{matrix}$ (89a)

$\begin{matrix} u_{ϕ r} & = \frac{1}{I_{e}} \frac{d {\overset{. .}{x}}_{Y r} (d^{2} + {(x_{Y r} - y_{0})}^{2}) - 2 d (x_{Y r} - y_{0}) {\dot{x}}_{Y r}^{2}}{{(d^{2} + {(x_{Y r} - y_{0})}^{2})}^{2}} \end{matrix}$ (89b)

Another (heavier) choice would be to simulate the generative model to get a prediction $(X (t + τ_{s} + τ_{a}), Y (t + τ_{s} + τ_{a}))$ for $(X (t), Y (t))$ .

4.4.3. Simulations

The following plots refer to the simulation of the occulomotor model in (83), with the following parameters:

I_{e} = 4.032 \times 10^{- 7} kg . m^{2}, d = 10 m, X_{0} = 0 m, Y_{0} = 0 m

(90)

The fluctuations $ζ_{X}, ζ_{Y}, ζ_{ψ}, ζ_{ϕ}$ are all smooth random functions (i.e., sums of cosine and sine with coefficients that are random variables with a normal distribution). More precisely, they are of the form depicted in Equation (16):

\begin{matrix} ζ_{*} (t) & = α (a_{0} + \sum_{j = 1}^{r} [a_{*, j} cos (\frac{2 π j}{L} t) + b_{*, j} sin (\frac{2 π j}{L} t)]), r = ⌊ L / λ ⌋ \end{matrix}

(91)

\begin{matrix} * \in \{X, Y, ψ, ϕ\} \end{matrix}

(92)

where $λ, L > 0$ , each $a_{*, j}$ and $b_{*, j}$ is an independent sample from $N (0, 1 / (2 r + 1))$ (where $N (μ, V)$ denotes the real normal distribution of mean $μ$ and variance V), and $⌊ \cdot ⌋$ is the floor function. Here, $L = 8, λ = 2, α = 10^{- 5}$ ; hence $r = 4$ . Let us consider a smooth eye tracking of a point along a regular curve, here a quatrefoil, shown in Figure 8 with the reference curve in red and the actual tracking (i.e., the eye movement) in blue. The trajectory type has been chosen because of its smoothness. Its precise time parametrisation is

\begin{matrix} X (t) & = 2 a {sin}^{2} (t) cos (t) \end{matrix}

(93a)

\begin{matrix} Y (t) & = 2 a {cos}^{2} (t) sin (t) \end{matrix}

(93b)

\begin{matrix} a & = 2 \end{matrix}

(93c)

Reference quatrefoil trajectory in red and actual movement from the generative model in blue.

The corresponding references $x_{X r} (t)$ , $x_{Y r} (t)$ and the actual values $x_{X} (t)$ , $x_{Y} (t)$ are plotted in Figure 9.

Plots of $x_{X r} (t)$ , $x_{Y r} (t)$ in red and of $x_{X} (t)$ , $x_{Y} (t)$ in blue for the quatrefoil.

The corresponding action tracking laws $u_{ψ}$ and $u_{ϕ}$ are shown in Figure 10. As a last example, let us consider the hypocycloid in Figure 11. We chose this type of trajectory because it may be seen as typical of saccadic eye movements. Its precise time parametrisation is

\begin{matrix} X (t) & = (a - b) cos (t) + b cos (\frac{(a - b) t}{b}) \end{matrix}

(94a)

\begin{matrix} Y (t) & = (a - b) sin (t) - b sin (\frac{(a - b) t}{b}) \end{matrix}

(94b)

\begin{matrix} a & = 2, b = \frac{6}{5} \end{matrix}

(94c)

Plots of the actions $u_{ψ}$ and $u_{ϕ}$ for the quatrefoil.

Reference hypocycloid trajectory in red and actual movement from the generative model in blue.

4.5. Active Inference

The principle of active inference can then be stated as extremisation of free energy via both action and perception, under a prior belief that action will extremise expected free energy (see, e.g., Figure 7 of [2]):

\begin{matrix} μ_{x}^{*} & = {argmin}_{μ_{x}} F (μ_{x}, μ_{u}, y (u)) \end{matrix}

(95a)

\begin{matrix} u^{*} & = {argmin}_{u} F (μ_{x}^{*}, μ_{u}, y (u)) \end{matrix}

(95b)

\begin{matrix} F (x, y) & = E_{q, x} (\ln \frac{q (x ∣ u)}{p (x ∣ u)}) - E_{q, x} (\ln p (y ∣ x, u)) \end{matrix}

(95c)

\begin{matrix} μ_{u} & = {argmin}_{u} G (u) \end{matrix}

(95d)

\begin{matrix} G (u) & = E_{q, y} (\ln \frac{q (y ∣ u)}{p (y ∣ y_{r})}) - E_{q, x, y} (\ln p (y ∣ x, u)) \end{matrix}

(95e)

with $μ_{x}$ being the agent’s estimate of the hidden state $x$ , and $F, G$ as in (21c) (complexity/accuracy) and (22b) (risk/ambiguity), written in a compact form.

\begin{matrix} E_{q, ξ} (ϕ) & = \int q (ξ (t) ∣ u (t)) ϕ (t) d ξ (t) \\ E_{q, ξ, χ} (ϕ) & = \int \int q (ξ (t), χ (t) ∣ u (t)) ϕ (t) d ξ (t) d χ (t) \end{matrix}

The above extremisations can be formulated as the solutions to a gradient descent.

4.6. Link with Flatness-Based Tracking

The imperative for active inference is the minimisation of surprise, namely, the discrepancy between expectations, or beliefs, and actual values. In the light of the previous decomposition of variational free energy in Section 3.1, we conclude by foregrounding the links between active inference and nonlinear control. Consider the agent’s dynamics $\dot{x} = f (x, u) + ζ_{x}$ . The dynamic feedback (see (48))

\begin{matrix} u & = φ (x, ξ, ζ_{x}, {\dot{ζ}}_{x}, \dots, ζ_{x}^{(σ_{u})}, v) \end{matrix}

(96a)

\begin{matrix} \dot{ξ} & = ψ (x, ξ, ζ_{x}, {\dot{ζ}}_{x}, \dots, ζ_{x}^{(σ_{ξ})}, v) \end{matrix}

(96b)

\begin{matrix} v & = Γ (ε_{a c t}, {\dot{ε}}_{a c t}, \dots, ε_{a c t}^{(σ_{v})}) \end{matrix}

(96c)

(where $v$ is the new input) transforms the agent’s dynamics to

\begin{matrix} \dot{x} & = f (x, φ (x, ξ, ζ_{x}, {\dot{ζ}}_{x}, \dots, ζ^{(σ_{u})}, v)) \end{matrix}

(97a)

\begin{matrix} \dot{ξ} & = ψ (x, ξ, ζ_{x}, {\dot{ζ}}_{x}, \dots, ζ_{x}^{(σ_{ξ})}, v) \end{matrix}

(97b)

\begin{matrix} v & = Γ (ε_{a c t}, {\dot{ε}}_{a c t}, \dots, ε_{a c t}^{(σ_{v})}) \end{matrix}

(97c)

The above Equation (97) then yield the following action surprise dynamics, or action discrepancy dynamics:

\begin{matrix} Ξ (ε_{a c t}, {\dot{ε}}_{a c t}, \dots, ε_{a c t}^{(σ_{ε_{a c t}})}) & = 0 \end{matrix}

(98)

This surprise dynamics will not, in general (i.e., for any feedback law (96)), imply the action discrepancy’s limit to be zero, i.e., a minimisation of the free energy. Let us now consider the tracking action (tracking feedback control) law stemming from (74):

\begin{matrix} u_{1} & = {\tilde{γ}}_{1} (ω, v, \dot{v}, \dots, v^{(σ_{v 1})}, ζ, \dot{ζ}, \dots, ζ^{({\tilde{σ}}_{ζ 1})}) \end{matrix}

(99a)

\begin{matrix} u_{2} & = {\tilde{γ}}_{2} (ω, v, \dot{v}, \dots, v^{(σ_{v 2})}, ζ, \dot{ζ}, \dots, ζ^{({\tilde{σ}}_{ζ 2})}) \\ ⋮ \end{matrix}

(99b)

\begin{matrix} u_{m} & = {\tilde{γ}}_{m} (ω, v, \dot{v}, \dots, v^{(σ_{v m})}, ζ, \dot{ζ}, \dots, ζ^{({\tilde{σ}}_{ζ m})}) \end{matrix}

(99c)

\begin{matrix} v & = (v_{1}, \dots, v_{m}) \end{matrix}

(99d)

\begin{matrix} v_{i} & = ω_{r}^{(κ_{i})} - λ_{z i} {(e_{z i}^{〈 κ_{i} - 1 〉})}^{T}, i = 1, \dots, m \end{matrix}

(99e)

which yields a special form of (98):

\begin{matrix} ε_{a c t}^{(n)} - Λ (0, ε_{a c t}, {\dot{ε}}_{a c t}, \dots ε_{a c t}^{(n - 1)}) & = 0 \end{matrix}

(100)

where $Λ$ is a linear function of its arguments such that (100) admits solely exponentially stable solutions. The tracking action (tracking feedback control) law (99) thus ensures the tracking of $ω$ to $ω_{r}$ , with stability through driving the discrepancy $ε_{a c t} = ω - ω_{r}$ to zero, minimising the risk in the expected free energy $G$ .

Example 7 (Oculomotor tracking control law).

Consider again the oculomotor Example 4. The tracking action (tracking feedback control) law (99) takes the form of (88):

$\begin{matrix} (\begin{matrix} u_{ψ} \\ u_{ϕ} \end{matrix}) & = \frac{d I_{e}}{δ_{X} δ_{Y}} (\begin{matrix} 2 δ_{Y} & 0 \\ 0 & δ_{X} \end{matrix}) [- (\begin{matrix} \frac{2 x_{v_{ψ}} ({\dot{x}}_{X} - {\dot{ζ}}_{X})}{d} + {\overset{. .}{ζ}}_{X} \\ \frac{2 x_{v_{ϕ}} ({\dot{x}}_{Y} - {\dot{ζ}}_{Y})}{d} + {\overset{. .}{ζ}}_{Y} \end{matrix}) + (\begin{matrix} {\overset{. .}{x}}_{X r} - λ_{X 1} {\dot{ε}}_{X} - λ_{X 0} ε_{X} \\ {\overset{. .}{x}}_{Y r} - λ_{Y 1} {\dot{ε}}_{y} - λ_{Y 0} ε_{y} \end{matrix})] - (\begin{matrix} ζ_{ψ} \\ ζ_{ϕ} \end{matrix}) \end{matrix}$ (101)

5. Prediction as a Link Between Active Inference and Differential Flatness

5.1. Delays and $δ$ -Flatness

In real systems stemming from neuroscience or physiology, delays are present, both in sensing and acting. This has not yet been taken into account in the previous sections, although it remains of utmost importance. Indeed, there are fundamental differences between a delay-free model and one including delays; the most striking difference is the infinite-dimensional character of delay systems. To be more precise, consider a delay-differential equation of the following form:

\begin{matrix} x (t) & = f (x (t - τ)) \end{matrix}

(102)

If we want to integrate such an equation, we do not need to know a pointwise initial condition such as $x (0)$ but have to know a whole function on a time interval $x : [- τ, 0) \to R$ . Another view on this is that there is not just one operator acting on the variables—time differentiation—but two: time differentiation and the time delay. Therefore, the structural properties (controllability and observability) of such systems become more complex, even for linear delay systems (see, e.g., [67]). The notion of flatness has been extended to the case of delay systems (see [68,69]). We shall informally recall the definition of the so-called $δ$ -flatness (a special case of the $π$ -flatness where the delays do not only occur on the sensor outputs and the action variables but also on the hidden state) and then expose two different kinds of predictors, which are needed for tracking. We denote a delay operator of amplitude $τ$ by $δ_{τ}$ :

\begin{matrix} (δ_{τ} x) (t) & = x (t - τ) \end{matrix}

(103)

In informal terms, a $δ$ -flat system is a system of delay differential equations that is differentially flat when one allows for delays. Let us see how this notion unfolds in a concrete example.

Example 8.

Consider Example 2 with a delay in the action:

$\begin{matrix} {\dot{x}}_{1} (t) & = x_{2} (t) + ζ_{x_{1}} (t) \end{matrix}$ (104a)

$\begin{matrix} {\dot{x}}_{2} (t) & = f (x_{1} (t), x_{2} (t)) + u (t - τ) + ζ_{x_{2}} (t) \end{matrix}$ (104b)

$\begin{matrix} y & = x_{1} (t) + ζ_{y} (t) \end{matrix}$ (104c)

with $τ > 0, τ \in R$ . This model is easily seen to be δ-flat, with y as a δ-flat output. Indeed, we have

$\begin{matrix} x_{1} (t) & = y (t) - ζ_{y} (t) \end{matrix}$ (105a)

$\begin{matrix} x_{2} (t) & = \dot{y} (t) - {\dot{ζ}}_{y} (t) - ζ_{x_{1}} (t) \\ u (t) & = \overset{. .}{y} (t + τ) - f (y (t + τ) - ζ_{y} (t + τ), \dot{y} (t + τ) - {\dot{ζ}}_{y} (t + τ) - ζ_{x_{1}} (t + τ)) - \end{matrix}$ (105b)

$\begin{matrix} {\overset{. .}{ζ}}_{y} (t + τ) - {\dot{ζ}}_{x_{1}} (t + τ) - ζ_{x_{2}} (t + τ) \end{matrix}$ (105c)

5.2. Trajectory Tracking and Predictors

When one wishes to perform trajectory tracking, as in (99), the presence of delays in the sensor output and/or in the action control will induce the necessity to predict some or all of the hidden states or sensor output. Consider this under the previous example.

Example 9.

The tracking control scheme (82) is now transformed into

$\begin{matrix} u (t) & = {\overset{. .}{y}}_{r} (t + τ) - f (y (t) - z_{y} (t), \dot{y} (t) - {\dot{z}}_{y} (t)) - \end{matrix}$

$\begin{matrix} {\overset{. .}{z}}_{y} (t) - {\dot{z}}_{x_{1}} (t) - z_{x_{1}} (t) - λ_{y, 1} {\dot{e}}_{y} (t) - λ_{y, 2} e_{y} (t) \end{matrix}$ (106a)

$\begin{matrix} y (t) = y (t + τ), & e_{y} (t) = ϵ_{y} (t + τ) \end{matrix}$ (106b)

$\begin{matrix} z_{y} (t) & = ζ_{y} (t + τ), z_{x_{1}} (t) = ζ_{x_{1}} (t + τ), z_{x_{2}} (t) = ζ_{x_{2}} (t + τ) \end{matrix}$ (106c)

We now review two kinds of predictors, which may be used in tracking control schemes. Consider the following type of mean generative model with delayed action and a linear relation between the sensory output and hidden state:

\begin{matrix} \dot{x} (t) & = f (x (t), u (t - τ)) \end{matrix}

(107a)

\begin{matrix} y (t) & = C x (t) \end{matrix}

(107b)

with $C \in R^{n \times p}$ . This model can be transformed into the advanced generative model

\begin{matrix} \dot{x} (t) & = f (x (t), u (t)) \end{matrix}

(108a)

\begin{matrix} y (t) & = C x (t) \end{matrix}

(108b)

\begin{matrix} x (t) & = x (t + τ), y (t) = y (t + τ) \end{matrix}

(108c)

The first scheme is a so-called delayed observer (see [70]):

\begin{matrix} \dot{\hat{x}} (t) & = f (\hat{x} (t), u (t)) + L (x (t) - \hat{x} (t - τ)) \end{matrix}

(109)

with $L \in R^{n \times n}$ being the predictor gain matrix. The simulation of Equation (109) yields an estimate $\hat{x} (t)$ of $x (t + τ)$ . It is proven [70] that the so-called prediction observer error $x (t) - \hat{x} (t - τ)$ tends to zero as t tends to infinity.

The second scheme is an integral form one (see [71,72,73]):

\begin{matrix} u (t) & = κ (x (t)) \end{matrix}

(110a)

\begin{matrix} x (t) & = \int_{t - τ}^{t} f (x (ξ), u (ξ)) d ξ + x (t) \end{matrix}

(110b)

where $κ$ is such that $\dot{x} = f (x, κ (x))$ is globally asymptotically stable at $x = 0$ ; i.e., for every trajectory $x (t)$ , we have $x (t) \to 0$ as $t \to \infty$ . Equation (110b) comes from the following simple observation:

\begin{matrix} \int_{t}^{t + τ} \dot{z} (ξ) d ξ & = z (t + τ) - z (t) \end{matrix}

(111)

The initial condition for the integral Equation (110b) for $x (t)$ is defined by

\begin{matrix} x (σ) & = \int_{- τ}^{σ} f (x (ξ), u (ξ)) d ξ + x (0), σ \in [- τ, 0] \end{matrix}

(112)

The predictor state $x (t)$ is given by the implicit relation (110b), which can be solved using various approximation strategies for the integral on the right-hand side. Then, the predictive control law of the simple example shall be implemented as follows.

Example 10.

The tracking control scheme (106a) is now transformed into

$\begin{matrix} u (t) & = {\overset{. .}{y}}_{r} (t + τ) - f (y (t) - z_{y} (t), \dot{y} (t) - {\dot{z}}_{y} (t)) - \end{matrix}$

$\begin{matrix} {\overset{. .}{z}}_{y} (t) - {\dot{z}}_{x_{1}} (t) - z_{x_{1}} (t) - λ_{y, 1} {\dot{e}}_{y} (t) - λ_{y, 2} e_{y} (t) \end{matrix}$ (113a)

$\begin{matrix} y (t) & = \int_{t - τ}^{t} f (y (ξ), u (ξ)) d ξ + y (t), e_{y} (t) = ϵ_{y} (t + τ) \end{matrix}$ (113b)

$\begin{matrix} z_{y} (t) & = ζ_{y} (t + τ), z_{x_{1}} (t) = ζ_{x_{1}} (t + τ), z_{x_{2}} (t) = ζ_{x_{2}} (t + τ) \end{matrix}$ (113c)

5.3. Generalised Coordinates

In the active inference framework, there is a crucial distinction between what is called motion of the mean compared to mean of the motion. To unpack this distinction we first need the notion of generalised coordinates, which is obtained through a linearised differentiation. Generally a differentiation operation ∂ in algebra is defined as one such that, for any variables (here time functions) $ω_{1}$ and $ω_{2}$ , the chain rule is fulfilled:

\begin{matrix} \partial (ω_{1} ω_{2}) & = \partial ω_{1} ω_{2} + ω_{1} \partial ω_{2} \end{matrix}

(114)

A linearised (or first-order) differentiation, $d_{1}$ , here denoted as ′, is an operator following the above chain rule such that, for any time functions $z (t)$ and $ζ (t)$ ,

\begin{matrix} z^{i} ζ^{(j)} & = 0 \end{matrix}

(115)

whenever $j > 0$ , $i + j > 1$ and denoting by $ζ^{(j)}$ the jth iterated application of the ′ operator to $ζ$ . This type of differentiation is linked to first-order stochastic realisation problems, as briefly discussed below.

Higher-order differentiations can also be considered. A k-th-order differentiation $d_{k}$ is an operator following the above chain rule such that, for any time functions $z (t)$ and $ζ (t)$ ,

\begin{matrix} z^{i} d_{k}^{j} ζ^{(j)} & = 0 \end{matrix}

(116)

whenever $j > 0$ , $i + j > k$ . This type of differentiation is linked to higher-order stochastic realisation problems, although these may induce quite involved computations.

In other words, suppose the solution $ξ (t)$ of

\begin{matrix} \dot{ξ} (t) & = F (ξ (t)) + η (t) \end{matrix}

(117)

can be expressed through a series expansion:

\begin{matrix} ξ (t) & = \sum_{i = 0}^{\infty} \frac{ξ (0)}{i!} t^{i} \end{matrix}

(118)

e.g., it could be an analytic function of time or a Gevrey series (i.e., belonging to a class of functions in between the analytic and the full $C^{\infty}$ one—see, e.g., [74,75]; see also [76] for extensions). Then, the various derivatives at the time origin can be recovered as follows:

\begin{matrix} \dot{ξ} (0) & = F (ξ (0)) + η (0) \end{matrix}

(119a)

\begin{matrix} \overset{. .}{ξ} (0) & = \frac{d}{d t} F (ξ (0)) + \dot{η} (0) \end{matrix}

(119b)

\begin{matrix} \overset{. . .}{ξ} (0) & = \frac{d^{2}}{d t^{2}} F (ξ (0)) + \overset{. .}{η} (0) \\ ⋮ \end{matrix}

(119c)

assuming the fluctuations $η (t)$ are sufficiently smooth and

\begin{matrix} \frac{d}{d t} F (ξ (0)) & = \nabla F (ξ (0)) \dot{ξ} (0) \end{matrix}

(120a)

\begin{matrix} \frac{d^{2}}{d t^{2}} F (ξ (0)) & = \nabla F (ξ (0)) \overset{. .}{ξ} (0) + {\dot{ξ}}^{T} (0) \nabla^{2} F (ξ (0)) \dot{ξ} (0) \\ ⋮ \end{matrix}

(120b)

The so-called local linear approximation, in terms of [32], amounts to neglecting all higher-order terms in Equation (120), i.e., considering

\begin{matrix} \frac{d^{k}}{d t^{k}} F (ξ (0)) & = \nabla F (ξ (0)) ξ^{(k)} (0) \end{matrix}

(121)

Such an approximation, when valid, enables one to solve the so-called stochastic realisation problem quite easily (recall that a realisation is a differential equation involving a hidden state obtained from an input/action–output/sensor differential equation—see, e.g., Definition 6; see, e.g., [77,78] on the stochastic realisation problem). This approximation is also justified when studying generalised Bayesian filtering under the Laplace approximation (see, e.g., [32], Subsections 3.3.4 and 3.3.5).

The generalised coordinates may be seen as a coordinate frame that moves with the current point. In this view, it is linked with one-form transformations one can apply to nonlinear systems (see, e.g., [79,80]); the latter are more general than the endogenous dynamical feedbacks used here. It is also linked with the Cartan moving frame method (see, e.g., [81]).

6. Conclusions, Limitations, and Future Directions

We have considered the utility of differential flatness through the lens of active inference (see, e.g., [1]). This utility has been detailed in terms of control as inference. Specifically, one might conclude that if the generative models that underwrite active inference or control as inference can be limited to the class of differentially flat models, we obtain an extremely efficient control-theoretic scheme. This work therefore enables a control-theoretic perspective on active inference as control as inference by focusing on action trajectories and the minimisation of various discrepancies (afforded by variational and expected free energy). From the perspective of active inference, this work is a primer on differential flatness and its particular relevance to the kinds of generative models one might consider and be committed to. Crucially, this paper is the first (provisional) attempt to consider expected free energy in the setting of continuous state-space models.

In addition to their roles in developing systems for control, one could consider other applications of the frameworks outlined in this paper. One area is the field of computational psychiatry (see, e.g., [82,83,84]), where one can develop generative models of decision-making tasks—solved with active inference—and use these to understand the computational mechanisms of psychopathology. Such models are often formulated in terms of the selection among discrete alternatives. However, as noted by our reviewers, differential flatness pertains to continuous, differentiable state spaces. While it is true that many applications of computational psychiatry focus on discrete probabilities, there are important areas of psychiatry that depend upon continuous variables of the sort addressed in differential flatness accounts. These include the altered motor dynamics associated with catatonia in psychotic disorders (see, e.g., [85]) and altered smooth pursuit eye movements in schizophrenia (see, e.g., [86,87]).

There are several related notions that could not be addressed here due to the lack of space. These include Liouvillian aspects (see [88,89,90] related to automatic control and [91,92] for the examination of this property in the setting of hypothalamic–pituitary–adrenal axis models and Wilson–Cowan population networks), and robust tracking with model-free control (see [66]). Liouvillian aspects, in particular, offer an opportunity to extend the notion of flatness when a model is not differentially flat. Future work will consider these and other issues related to energy transmission and controllability (see, e.g., [93,94] for a formulation of the corresponding problem) and variational function transmission. These deal with characterising a model in terms of salient features and understanding how reparameterisations might transform such features.

We discuss the following points, emphasising limitations of the current treatment and sketching some future directions.

Generalised coordinates, where some limitations—due to its approximate character—may be avoided through smooth random realisations.
Extensions of flatness: Liouvillian characters to deal with cases where the model is not differentially flat.
Observers and algebraic estimators to estimate the hidden state from sensor output.
Robust control law synthesis to cope with uncertainty under the generative model, including fluctuations.
Constraint fulfilment, where constraints are imposed on the hidden state, the action, and their time derivatives.
Feature transmission: How a feature of interest; e.g., the energy ( $L^{2}$ norm), the slope or curvature, etc., is transformed through functional parametrisation.

6.1. Generalised Coordinate Limitations

The generalised coordinates are not appropriate for deriving tracking feedback action in the sense being considered here. Indeed, the dynamical extension algorithm needs to make the action appear through time differentiation, which will not be the case unless the action was already present in the state-space dynamics. More precisely, for the components of the flat output whose Brunovksỳ index $κ_{i}$ is strictly superior to 2, the generalised coordinates are not appropriate, since successive application of $d_{1}$ will not make the action appear. For the components whose index $κ_{i} = 1$ or 2, the generalised coordinates will be sufficient.

Let us illustrate this through three simple examples.

First consider the longitudinal motion of a car, used, for instance, in ACCs (Automatic Cruise Controllers):
$\begin{matrix} M {\dot{V}}_{x} & = - F (V_{x}) + \frac{1}{r} u \end{matrix}$ (122)
with M being the car’s mass, $V_{x} (t)$ the longitudinal speed of the vehicle’s centre of gravity, $- F (V_{x})$ the force due to the wind and to friction of the tyres on the ground, r the mean wheel radius, and u the traction engine propulsion torque, taken as the action. This system is trivially flat, with flat output $V_{x}$ , and the preceding equation is rewritten as
$\begin{matrix} M V_{x}^{'} & = - F (V_{x}, {\dot{V}}_{x}) + \frac{1}{r} u \end{matrix}$

The action is already present in the dynamics equation, $κ = 1$ , and there is no need to differentiate more. The generalised coordinate dynamics will lead to the same action tracking controller as the original one.
Second, consider the oculomotor example again. Then one supplementary differentiation of $x_{X}$ (resp. $x_{Y}$ ) will be sufficient. Both of these will involve $f (x, u)$ and its partial derivatives with respect to $x$ and $u$ in the flat output dynamics (see (69)):
$\begin{matrix} (\begin{matrix} x_{X} \\ x_{Y} \end{matrix}) & = \frac{1}{2 d I_{e}} (\begin{matrix} δ_{X} & 0 \\ 0 & 2 δ_{Y} \end{matrix}) (\begin{matrix} u_{ψ} + ζ_{ψ} \\ u_{ϕ} + ζ_{ϕ} \end{matrix}) + (\begin{matrix} \frac{2 x_{v_{ψ}} ({\dot{x}}_{X} - {\dot{ζ}}_{X})}{d} + {\overset{. .}{ζ}}_{X} \\ \frac{2 x_{v_{ϕ}} ({\dot{x}}_{Y} - {\dot{ζ}}_{Y})}{d} + {\overset{. .}{ζ}}_{Y} \end{matrix}) = (\begin{matrix} v_{X} \\ v_{Y} \end{matrix}) \end{matrix}$ (123a)

$\begin{matrix} δ_{X} & = d^{2} + {(X - X_{0} - ζ_{X})}^{2} δ_{Y} = d^{2} + {(Y - Y_{0} - ζ_{Y})}^{2} \end{matrix}$ (123b)
Third, consider one of the most popular model for diabetes, namely the Bergman minimal model (see, e.g., [95,96]):
$\begin{matrix} \dot{G} & = - k_{1} (G - G_{b}) - X G + D = f_{G} (G, X, D) \end{matrix}$ (124a)

$\begin{matrix} \dot{X} & = - k_{2} X + k_{3} (I - I_{b}) = f_{X} (X, I) \end{matrix}$ (124b)

$\begin{matrix} \dot{I} & = - k_{4} (I - I_{b}) + u = f_{I} (I, u) \end{matrix}$ (124c)

$\begin{matrix} D (t) & = \frac{1}{2} (1 + tanh (γ (t - t_{b o l}))) B e^{- d (t - t_{b o l})} \end{matrix}$ (124d)
where $G (t)$ is the concentration of blood glucose; $X (t)$ is the concentration of insulin in the tissue fluid; $I (t)$ is the concentration of insulin in the blood; $G_{b}$ and $I_{b}$ are the basal concentrations of glucose and insulin; and $D (t)$ is the glycaemic influence of a meal, seen as a fluctuation. In addition, $k_{1}$ , $k_{2}$ , and $k_{3}$ are positive-valued parameters that control the rates of appearance and disappearance of glucose and insulin; $t_{b o l}$ is the meal intake time (bolus intake).

We wish to control the glucose concentration $G (t)$ in order to track a desired trajectory $G_{r} (t)$ known in advance. To do so, we apply the dynamical extension algorithm, and we have to differentiate Equation (124a) two times in order for the action u to appear. We get
$\begin{matrix} \overset{. .}{G} & = - (k_{1} + X) \dot{G} - G \dot{X} + \dot{D} \end{matrix}$
and
$\begin{matrix} \overset{. . .}{G} & = - 2 \dot{G} \dot{X} - (k_{1} + X) \overset{. .}{G} - G \overset{. .}{X} + \overset{. .}{D} \end{matrix}$
and the control will appear in $\dot{I}$ , present in $\overset{. .}{X}$ . The generalised coordinates unfold as follows. The preceding model is rewritten, within the differentiation $d_{1}$ , as follows:
$\begin{matrix} G^{'} & = - k_{1} (G - G_{b}) - X G + D = f_{G} (G, X, D) \end{matrix}$ (125a)

$\begin{matrix} X^{'} & = - k_{2} X + k_{3} (I - I_{b}) = f_{X} (X, I) \end{matrix}$ (125b)

$\begin{matrix} I^{'} & = - k_{4} (I - I_{b}) + u = f_{I} (I, u) \end{matrix}$ (125c)

$\begin{matrix} D (t) & = \frac{1}{2} (1 + tanh (γ (t - t_{b o l}))) B e^{- d (t - t_{b o l})} \end{matrix}$ (125d)

The successive application of $d_{1}$ yields
$\begin{matrix} G^{'} & = \partial_{G} f_{G} G^{'} + \partial_{X} f_{G} X^{'} + \partial_{D} f_{G} D^{'} = - (k_{1} + X) G^{'} - G X^{'} + D^{'} \end{matrix}$
and
$\begin{matrix} G^{″} & = - (k_{1} + X) G^{″} - G X^{″} + D^{″} \end{matrix}$

Thus, the term $- 2 X^{'} G^{'}$ is missing when compared to its counterpart with time differentiation. And, if one attempted to use this in order to produce an action tracking feedback, the resulting error dynamics would be of the form
$\begin{matrix} ε_{G}^{'''} & = - λ_{G 2} ε_{G}^{″} - λ_{G 1} ε_{G}^{'} - λ_{G 0} ε_{G} - 2 X^{'} G^{'} \end{matrix}$ (126)
with $ε_{G} = G - G_{r}$ and $G_{r} (t)$ being a reference glucose trajectory. Although the $λ_{G i}$ are such that the solutions of
$\begin{matrix} ε_{G}^{'''} + λ_{G 2} ε_{G}^{''} + λ_{G 1} ε_{G}^{'} + λ_{G 0} ε_{G} & = 0 \end{matrix}$
are exponentially decreasing, the solutions of (126) may not tend to zero, since it is steered by $- 2 X^{'} G^{'}$ .

6.2. Smooth Random Realisation

The use of this type of differentiation is also linked to the stochastic realisation problem (see, e.g., [10], 4.(c).(i), p. 15), which may be quite complex in the general case. In contrast to this, the differential flatness property yields a weak Brunovský canonical form through the flat output (see [97], Subsection 4.1 and Definition 4.3). This canonical form yields the so-called flat output dynamics, which readily gives a smooth random realisation, as the following proposition states.

Proposition 4 (Smooth random realisation).

Consider a differentially flat model, with flat output $ω = (ω_{1}, \dots, ω_{m})$ . Then, there exists integers $κ_{1}, \dots, κ_{m}$ such that $(ω_{1}, {\dot{ω}}_{1}, \dots, ω_{1}^{(κ_{1} - 1)}, ω_{2},$ $\dots,$ $ω_{m}^{(κ_{m} - 1)})$ is a state. This state, through the flat output dynamics, yields a smooth random realisation of the model.

Proof.

The existence of the integers $κ_{1}, \dots, κ_{m}$ such that $(ω_{1},$ ${\dot{ω}}_{1},$ $\dots ω_{1}^{(κ_{1} - 1)}, ω_{2}, \dots, ω_{m}^{(κ_{m} - 1)})$ is a state is ensured by the dynamical extension algorithm. To see this, and to obtain the associated realisation, recall the so-called flat output dynamics (Equation (53) above), which can be written as the following state-space representation:

$\begin{matrix} {\dot{x}}_{1} & = x_{2} \end{matrix}$ (127a)

$⋮$

$\begin{matrix} {\dot{x}}_{κ_{1} - 1} & = x_{κ_{1}} \end{matrix}$ (127b)

$\begin{matrix} {\dot{x}}_{κ_{1}} & = γ_{1} (x, u, \dot{u}, \dots, u^{(σ_{u 1})}, ζ, \dot{ζ}, \dots, ζ^{(σ_{ζ 1})}) \end{matrix}$ (127c)

$\begin{matrix} {\dot{x}}_{κ_{1} + 1} & = x_{κ_{1} + 2} \end{matrix}$ (127d)

$⋮$

$\begin{matrix} {\dot{x}}_{κ_{1} + κ_{2} - 1} & = x_{κ_{1} + κ_{2}} \end{matrix}$ (127e)

$\begin{matrix} {\dot{x}}_{κ_{1} + κ_{2}} & = γ_{2} (x, u, \dot{u}, \dots, u^{(σ_{u 1})}, ζ, \dot{ζ}, \dots, ζ^{(σ_{ζ 2})}) \end{matrix}$ (127f)

$⋮$

$\begin{matrix} {\dot{x}}_{n - 1} & = x_{n} \end{matrix}$ (127g)

$\begin{matrix} {\dot{x}}_{n} & = γ_{m} (x, u, \dot{u}, \dots, u^{(σ_{u m})}, ζ, \dot{ζ}, \dots, ζ^{(σ_{ζ m})}) \end{matrix}$ (127h)

with $x = (x_{1}, \dots, x_{n})$ , since $κ_{1} + \dots + κ_{m} = n$ , and

$\begin{matrix} \begin{matrix} x_{1} & = & ω_{1}, & x_{2} & = & {\dot{ω}}_{1}, \dots, & x_{κ_{1}} & = & ω_{1}^{(κ_{1} - 1)} \\ x_{κ_{1} + 1} & = & ω_{2}, & x_{κ_{1} + 2} & = & {\dot{ω}}_{2}, \dots, & x_{κ_{1} + κ_{2}} & = & z^{(κ_{2} - 1)} \\ ⋮ \\ x_{κ_{1} + \dots + κ_{m - 1} + 1} & = & ω_{m}, & \dots & \dots, & x_{n} & = & ω_{m}^{(κ_{m} - 1)} . \end{matrix} \end{matrix}$

Then $x$ is a state that yields a smooth random realisation of the original system, based on the differential flatness property. □

The present framework can be seen as an adequate proposal for the future direction in Subsection 4.1,, paragraph “Stochastic control via generalized coordinates” of [32]. It also embeds, in a rather simple fashion, non-stationary smooth random signals (see Remark 4.1.1 in [32]).

6.3. Other Future Directions

We now briefly consider future directions that could not be unpacked in this paper due to lack of space.

6.3.1. Extensions of Flatness: Liouvillian Characters

Although many practical system models are differentially flat, some, among specific classes, are not; this is especially the case for many biological and neuroscience population models. Fortunately, another analogous property is shared by a much wider class, the one of so-called Liouvillian systems. Liouvillian systems can be seen as an extension of flat systems [88,89,98]. The most striking property of the latter is that all state and control variables of the system can be directly expressed—without integration of any differential equation—in terms of the flat output and a finite number of its time derivatives. So-called Liouvillian systems share a similar property, but in order to derive the trajectories of a Liouvillian system, we also need integration of a few differential equations whose solutions are known analytically. It follows that flatness-based control approaches can be extended up to solving a finite number of differential equations.

6.3.2. Observers and Algebraic Estimators

We have here dealt with control law synthesis, but we did not touch on the equally important subject of observer or estimator synthesis, i.e., the procedures aimed at estimating the hidden state from sensor measurements. Two main paths are available: The first—of so-called observers—amounts to a simulation of the mean generative model, driven by the error $y - g (x)$ between the sensor output and its observation model, a function of the state (see, e.g., [99,100]). The second, more direct, approach is to directly make use of the so-called constructive observability (see, e.g., [4,101]), where the state is a function of sensor output, action, and their derivatives: $x = ψ_{x} (y, u, \dot{y}, \dot{u}, \dots, y^{(ρ_{y})}, u^{(ρ_{u})})$ . This last procedure requires the estimation of sensor output derivatives (see [6,102]).

6.3.3. Robust Control Law Synthesis

When the mean generative model is a crude approximation to the real system, the cumulative effect of the fluctuations needs to be compensated for in the control scheme. So-called robust control laws aim to fulfil a control goal, such as trajectory tracking, despite the fluctuations, or perturbations, and unmodelled dynamics (i.e., model mismatch). These can be achieved via the so-called model-free control scheme, where the cumulative effects of fluctuations are estimated online and compensated for on the fly (see, e.g., [66]). The recent HEOL scheme (see [103]) achieves the same goal through slightly different techniques. While model-free control is often associated with flatness-based feedback tracking control laws synthesised on a nominal, or mean generative, model, HEOL one uses an open-loop flatness-based scheme and the tangent, or variational system, associated with the simplified flat system, i.e., the linearised system around a reference trajectory of the simplified flat system. Other instances include, among others, sliding mode control and active disturbance rejection control (see, e.g., [62,64,65]).

6.3.4. Constraint Fulfilment

In real-world applications, dynamical systems are always subject to constraints: on the state (for example the configuration space of a robot is not the whole space) and on the action (e.g., muscles have finite power). These can be handled in the present framework through optimisation-based planning of the flat output trajectories (see, e.g., [104,105,106]). A promising framework is the one of model-free predictive control (MFPC, see [107]), mixing the popular predictive control (see, e.g., [108,109]) and model-free control, cited above.

6.3.5. Feature Transmission

Another interesting characteristic—of the functional parametrisation associated with differential flatness—is one of feature transmission. This includes the transmission of geometric features: how the curvature of the flat output trajectory is related to the curvature of the action; in other words, deducing the action’s curvature as a function of the flat output curvature and its time derivatives from the relation yielding the action as a function of the flat output and its derivatives.

Supplementary Materials

The following supporting information can be downloaded at https://github.com/hugues-mounier/AI-FP-differential-flatness-and-smooth-random-realisation/.

Author Contributions

Methodology, H.M., T.P. and K.F.; Software, H.M.; Writing—original draft, H.M.; Writing—review & editing, T.P. and K.F. All authors have read and agreed to the published version of the manuscript.

Data Availability Statement

The data and code presented in this study are openly available in https://github.com/hugues-mounier/AI-FP-differential-flatness-and-smooth-random-realisation/ (accessed on 29 October 2025).

Conflicts of Interest

The authors declare no conflicts of interest.

Funding Statement

This research was funded by Wellcome Trust grant number 226793/Z/22/Z. The Third author (TR) is supported by an NIHR Academic Clinical Fellowship [ref: ACF-2023-13-013].

Footnotes

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

References

1.Friston K. A free energy principle for a particular physics. arXiv. 2019 doi: 10.48550/arXiv.1906.10184.1906.10184 [DOI] [Google Scholar]
2.Friston K., Da Costa L., Sajid N., Heins C., Ueltzhöffer K., Pavliotis G.A., Parr T. The free energy principle made simpler but not too simple. Phys. Rep. 2023;1024:1–29. doi: 10.1016/j.physrep.2023.07.001. [DOI] [Google Scholar]
3.Ramstead M.J.D., Sakthivadivel D.A.R., Heins C., Koudahl M., Millidge B., Da Costa L., Klein B., Friston K.J. On Bayesian mechanics: A physics of and by beliefs. Interface Focus. 2023;13:20220029. doi: 10.1098/rsfs.2022.0029. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Fliess M., Glad S.T. Essays on Control. Birkhäuser; Boston, TX, USA: 1993. An Algebraic Approach to Linear and Nonlinear Control; pp. 223–267. [DOI] [Google Scholar]
5.Fliess M., Lévine J., Martin P., Rouchon P. Flatness and defect of non-linear systems: Introductory theory and examples. Int. J. Control. 1995;61:1327–1361. doi: 10.1080/00207179508921959. [DOI] [Google Scholar]
6.Fliess M., Join C., Ramirez H.S. Non-linear estimation is easy. Int. J. Model. Identif. Control. 2008;4:12. doi: 10.1504/IJMIC.2008.020996. [DOI] [Google Scholar]
7.Baltieri M. PhD thesis. University of Sussex; Brighton, UK: 2019. Active Inference: Building a New Bridge Between Control Theory and Embodied Cognitive Science. [Google Scholar]
8.Baltieri M., Buckley C. PID Control as a Process of Active Inference with Linear Generative Models. Entropy. 2019;21:257. doi: 10.3390/e21030257. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Lanillos P., Meo C., Pezzato C., Meera A.A., Baioumy M., Ohata W., Tschantz A., Millidge B., Wisse M., Buckley C.L., et al. Active Inference in Robotics and Artificial Agents: Survey and Challenges. arXiv. 2021 doi: 10.48550/arXiv.2112.01871.2112.01871 [DOI] [Google Scholar]
10.Da Costa L., Friston K., Heins C., Pavliotis G.A. Bayesian mechanics for stationary processes. Proc. R. Soc. A Math. Phys. Eng. Sci. 2021;477:20210518. doi: 10.1098/rspa.2021.0518. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Meera A.A., Wisse M. A Brain Inspired Learning Algorithm for the Perception of a Quadrotor in Wind. arXiv. 2021 doi: 10.48550/arXiv.2109.11971.2109.11971 [DOI] [Google Scholar]
12.Tschantz A., Barca L., Maisto D., Buckley C.L., Seth A.K., Pezzulo G. Simulating homeostatic, allostatic and goal-directed forms of interoceptive control using Active Inference. Biol. Psychol. 2022;169:108266. doi: 10.1016/j.biopsycho.2022.108266. [DOI] [PubMed] [Google Scholar]
13.Baioumy M., Pezzato C., Ferrari R., Hawes N. Unbiased Active Inference for Classical Control. arXiv. 2022 doi: 10.48550/arXiv.2207.13409.2207.13409 [DOI] [Google Scholar]
14.Bos F., Meera A.A., Benders D., Wisse M. Free Energy Principle for State and Input Estimation of a Quadcopter Flying in Wind; Proceedings of the 2022 International Conference on Robotics and Automation (ICRA); Philadelphia, PA, USA. 23–27 May 2022; Piscataway, NJ, USA: IEEE; 2022. [DOI] [Google Scholar]
15.Rudolph J. Flatness-Based Control—An Introduction. 1st ed. Shaker GmbH; Düren, Germany: 2021. [DOI] [Google Scholar]
16.Francine Diener M.D., editor. Nonstandard Analysis in Practice. Springer; Berlin/Heidelberg, Germany: 1995. [DOI] [Google Scholar]
17.Fliess M. Probabilités et fluctuations quantiques. Comptes Rendus. MathéMatique. 2007;344:663–668. doi: 10.1016/j.crma.2007.04.001. [DOI] [Google Scholar]
18.Fliess M., Join C. Towards a new viewpoint on causality for time series. ESAIM Proc. Surv. 2015;49:37–52. doi: 10.1051/proc/201549004. [DOI] [Google Scholar]
19.Han X., Kloeden P.E. Random Ordinary Differential Equations and Their Numerical Solution. Springer; Singapore: 2017. [DOI] [Google Scholar]
20.Friz P.K., Hairer M. A Course on Rough Paths: With an Introduction to Regularity Structures. Springer International Publishing; Cham, Switzerland: 2020. [DOI] [Google Scholar]
21.Kahane J.P. Some Random Series of Functions Series. 2nd ed. Cambridge University Press; Cambridge, UK: 1993. Number 5 in Cambridge Studies in Advanced Mathematics. [Google Scholar]
22.Hill M. Convergence of random Fourier series; Proceedings of the University of Chicago 2012 Summer Program for Undergraduate; 2012. [(accessed on 29 October 2025)]. Available online: https://math.uchicago.edu/~may/REU2012/REUPapers/Hill.pdf. [Google Scholar]
23.Tzvetkov N. Riemannian analogue of a Paley-Zygmund theorem. Séminaire Équations aux Dérivées Partielles (2008–2009) [(accessed on 29 October 2025)]. pp. 1–14. Available online: https://eudml.org/doc/11189.
24.Marcus M.B., Pisier G. Random Fourier Series with Applications to Harmonic Analysis (AM-101) Princeton University Press; Princeton, NJ, USA: 1981. [Google Scholar]
25.Filip S., Javeed A., Trefethen L.N. Smooth Random Functions, Random ODEs, and Gaussian Processes. SIAM Rev. 2019;61:185–205. doi: 10.1137/17M1161853. [DOI] [Google Scholar]
26.Robert J., Adler J.E.T. Random Fields and Geometry. Springer; New York, NY, USA: 2007. [DOI] [Google Scholar]
27.Adler R.J., Taylor J.E. Topological Complexity of Smooth Random Functions: École d’Été de Probabilités de Saint-Flour XXXIX-2009. Springer; Berlin/Heidelberg, Germany: 2011. [DOI] [Google Scholar]
28.Adler R.J., Taylor J.E., Worsley K.J. Applications of Random Fields and Geometry: Foundations and Case Studies. Book in Progress. 2016. [(accessed on 29 October 2025)]. Available online: https://cris.technion.ac.il/en/publications/applications-of-random-fields-and-geometry-foundations-and-case-s/
29.Friston K.J., Holmes A.P., Worsley K.J., Poline J., Frith C.D., Frackowiak R.S.J. Statistical parametric maps in functional imaging: A general linear approach. Hum. Brain Mapp. 1994;2:189–210. doi: 10.1002/hbm.460020402. [DOI] [Google Scholar]
30.Worsley K.J., Marrett S., Neelin P., Vandal A.C., Friston K.J., Evans A.C. A unified statistical approach for determining significant signals in images of cerebral activation. Hum. Brain Mapp. 1996;4:58–73. doi: 10.1002/(SICI)1097-0193(1996)4:1<58::AID-HBM4>3.0.CO;2-O. [DOI] [PubMed] [Google Scholar]
31.Gyöngy I., Michaletzky G. On Wong–Zakai approximations with δ–martingales. Proc. R. Soc. London. Ser. A Math. Phys. Eng. Sci. 2004;460:309–324. doi: 10.1098/rspa.2003.1244. [DOI] [Google Scholar]
32.Da Costa L., Da Costa N., Heins C., Medrano J., Pavliotis G.A., Parr T., Meera A.A., Friston K. A Theory of Generalized Coordinates for Stochastic Differential Equations. Stud. Appl. Math. 2025;154:e70062. doi: 10.1111/sapm.70062. [DOI] [Google Scholar]
33.Stratonovich R.L. Topics in the Theory of Random Noise. Volume 2 Macmillan Education; South Yarra, Australia: 1967. [Google Scholar]
34.Esser C., Jaffard S., Vedel B. Regularity properties of random wavelet series. arXiv. 2023 doi: 10.1090/tpms/1205.2304.00811 [DOI] [Google Scholar]
35.Aubry J.M., Jaffard S. Emergent Nature. World Scientific; Singapore: 2002. Random wavelet series: Theory and applications. [DOI] [Google Scholar]
36.Adler R.J. On excursion sets, tube formulas and maxima of random fields. Ann. Appl. Probab. 2000;10:1–74. doi: 10.1214/aoap/1019737664. [DOI] [Google Scholar]
37.Krishnan S.R., Taylor J.E., Adler R.J. The Intrinsic geometry of some random manifolds. Electron. Commun. Probab. 2017;22:1–12. doi: 10.1214/16-ecp4763. [DOI] [Google Scholar]
38.Yuan R.S., Ma Y.A., Yuan B., Ao P. Lyapunov function as potential function: A dynamical equivalence. Chin. Phys. B. 2014;23:010505. doi: 10.1088/1674-1056/23/1/010505. [DOI] [Google Scholar]
39.Parr T., Da Costa L., Friston K. Markov blankets, information geometry and stochastic thermodynamics. Philos. Trans. R. Soc. A Math. Phys. Eng. Sci. 2019;378:20190159. doi: 10.1098/rsta.2019.0159. [DOI] [PMC free article] [PubMed] [Google Scholar]
40.Wang H., Hu W., Gan X., Ao P. The generalized Lyapunov function as Ao’s potential function: Existence in dimensions 1 and 2. J. Appl. Anal. Comput. 2023;13:359–375. doi: 10.11948/20220149. [DOI] [Google Scholar]
41.Khalil H.K. Nonlinear Control. Pearson; Upper Saddle River, NJ, USA: 2014. [Google Scholar]
42.Fliess M. A note on the invertibility of nonlinear input-output differential systems. Syst. Control Lett. 1986;8:147–151. doi: 10.1016/0167-6911(86)90073-3. [DOI] [Google Scholar]
43.Fliess M., Lévine J., Martin P., Rouchon P. On Differentially Flat Nonlinear Systems. IFAC Proc. Vol. 1992;25:159–163. doi: 10.1016/s1474-6670(17)52275-2. [DOI] [Google Scholar]
44.Fliess M., Lévine J., Martin P., Rouchon P. A Lie-Bäcklund approach equivalence and flatness of nonlinear systems. IEEE Trans. Automat. Control. 1999;44:922–937. doi: 10.1109/9.763209. [DOI] [Google Scholar]
45.Sira-Ramírez H. Differentially Flat Systems. CRC Press; Boca Raton, FL, USA: 2004. [Google Scholar]
46.Hilbert D. Über den Begriff der Klasse von Differentialgleichungen. Math. Ann. 1912;73:95–108. doi: 10.1007/BF01456663. [DOI] [Google Scholar]
47.Cartan M. Sur l’équivalence absolue de certains systèmes d’équations différentielles et sur certaines familles de courbes. Bull. Soc. Math. Fr. 1914;2:12–48. doi: 10.24033/bsmf.938. [DOI] [Google Scholar]
48.Fuller S., Greiner B., Moore J., Murray R., van Paassen R., Yorke R. The Python Control Systems Library (python-control); Proceedings of the 60th IEEE Conference on Decision and Control (CDC); Austin, TX, USA. 14–17 December 2021; Piscataway, NJ, USA: IEEE; 2021. pp. 4875–4881. [Google Scholar]
49.Perrin D. Algebraic Geometry. Springer; London, UK: 2008. [DOI] [Google Scholar]
50.Eisenbud D., Harris J. The Geometry of Schemes. 1st ed. Springer; New York, NY, USA: 2000. Graduate Texts in Mathematics. [Google Scholar]
51.Agrios M. A Very Elementary Introduction to Sheaves. arXiv. 2022 doi: 10.48550/arXiv.2202.01379.2202.01379 [DOI] [Google Scholar]
52.Curry J. Sheaves, Cosheaves and Applications. arXiv. 20131303.3255 [Google Scholar]
53.Perrinet L.U., Adams R.A., Friston K.J. Active inference, eye movements and oculomotor delays. Biol. Cybern. 2014;108:777–801. doi: 10.1007/s00422-014-0620-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
54.Adams R.A., Aponte E., Marshall L., Friston K.J. Active inference and oculomotor pursuit: The dynamic causal modelling of eye movements. J. Neurosci. Methods. 2015;242:1–14. doi: 10.1016/j.jneumeth.2015.01.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
55.Parr T., Friston K.J. Active inference and the anatomy of oculomotion. Neuropsychologia. 2018;111:334–343. doi: 10.1016/j.neuropsychologia.2018.01.041. [DOI] [PMC free article] [PubMed] [Google Scholar]
56.Parr T., Friston K.J. The computational pharmacology of oculomotion. Psychopharmacology. 2019;236:2473–2484. doi: 10.1007/s00213-019-05240-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
57.Harris D., Vine S., Wilson M., Arthur T. The Relationship Between Environmental Statistics and Predictive Gaze Behaviour During a Manual Interception Task: Eye Movements as Active Inference. Comput. Brain Behav. 2023;7:225–241. doi: 10.1007/s42113-023-00190-5. [DOI] [Google Scholar]
58.Wong A. Listing’s law: Clinical significance and implications for neural control. Surv. Ophthalmol. 2004;49:563–575. doi: 10.1016/S0039-6257(04)00134-1. [DOI] [PubMed] [Google Scholar]
59.Ekstrand B. Equations of motion for a two-axes gimbal system. IEEE Trans. Aerosp. Electron. Syst. 2001;37:1083–1091. doi: 10.1109/7.953259. [DOI] [Google Scholar]
60.Siciliano B., Khatib O., editors. Springer Handbook of Robotics. Springer International Publishing; Cham, Switzerland: 2016. [DOI] [Google Scholar]
61.Fliess M., Join C., Moussa K., Djouadi S.M., Alsager M.W. Toward simple in silico experiments for drugs administration in some cancer treatments. IFAC-PapersOnLine. 2021;54:245–250. doi: 10.1016/j.ifacol.2021.10.263. [DOI] [Google Scholar]
62.Sira-Ramirez H., Luviano-Juárez A., Ramírez-Neria M., Zurita-Bustamante E.W. Active Disturbance Rejection Control of Dynamic Systems: A Flatness Based Approach. Butterworth-Heinemann; Oxford, UK: 2017. [Google Scholar]
63.Sira-Ramírez H., Gómez-León B.C., Aguilar-Orduña M.A. A two stage Active Disturbance Rejection Control design for under-actuated nonlinear systems. IFAC-PapersOnLine. 2023;56:4539–4544. doi: 10.1016/j.ifacol.2023.10.949. [DOI] [Google Scholar]
64.Utkin V.I. Sliding Modes in Control and Optimization. Springer; Berlin/Heidelberg, Germany: 1992. [DOI] [Google Scholar]
65.Shtessel Y., Edwards C., Fridman L., Levant A. Sliding Mode Control and Observation. Springer; New York, NY, USA: 2014. [DOI] [Google Scholar]
66.Fliess M., Join C. Model-free control. Int. J. Control. 2013;86:2228–2252. doi: 10.1080/00207179.2013.810345. [DOI] [Google Scholar]
67.Fliess M., Mounier H. Controllability and observability of linear delay systems: An algebraic approach. ESAIM Control. Optim. Calc. Var. 1998;3:301–314. doi: 10.1051/cocv:1998111. [DOI] [Google Scholar]
68.Mounier H., Rudolph J. Flatness-based control of nonlinear delay systems: A chemical reactor example. Int. J. Control. 1998;71:871–890. doi: 10.1080/002071798221614. [DOI] [Google Scholar]
69.Mounier H., Rudolph J. Flatness and quasi-static state feedback in non-linear delay systems. Int. J. Control. 2008;81:445–456. doi: 10.1080/00207170701579437. [DOI] [Google Scholar]
70.Estrada-Sánchez I., Velasco-Villa M., Rodríguez-Cortés H. Prediction-Based Control for Nonlinear Systems with Input Delay. Math. Probl. Eng. 2017;2017:7415418. doi: 10.1155/2017/7415418. [DOI] [Google Scholar]
71.Krstic M. Input Delay Compensation for Forward Complete and Strict-Feedforward Nonlinear Systems. IEEE Trans. Autom. Control. 2010;55:287–303. doi: 10.1109/TAC.2009.2034923. [DOI] [Google Scholar]
72.Bresch-Pietri D., Petit N., Krstic M. Prediction-based control for nonlinear state- and input-delay systems with the aim of delay-robustness analysis; Proceedings of the 2015 54th IEEE Conference on Decision and Control (CDC); Osaka, Japan. 15–18 December 2015; Piscataway, NJ, USA: IEEE; 2015. pp. 6403–6409. [DOI] [Google Scholar]
73.Karafyllis I., Krstic M. Predictor Feedback for Delay Systems: Implementations and Approximations. Springer International Publishing; Cham, Switzerland: 2017. [DOI] [Google Scholar]
74.Rodino . Linear Partial Differential Operators in Gevrey Spaces. World Scientific; Singapore: 1993. pp. 5–59. [DOI] [Google Scholar]
75.Balser W. Formal Power Series and Linear Systems of Meromorphic Ordinary Differential Equations. Springer; New York, NY, USA: 2000. [DOI] [Google Scholar]
76.Teofanov N., Tomić F., Žigić M. An introduction to extended Gevrey regularity. arXiv. 2024 doi: 10.3390/axioms13060352.2404.17366 [DOI] [Google Scholar]
77.Lindquist A., Picci G. On the Stochastic Realization Problem. SIAM J. Control Optim. 1979;17:365–389. doi: 10.1137/0317028. [DOI] [Google Scholar]
78.Veeravalli T., Raginsky M. Revisiting Stochastic Realization Theory using Functional Itô Calculus. arXiv. 2024 doi: 10.1016/j.ifacol.2024.10.190.2402.10157 [DOI] [Google Scholar]
79.van Nieuwstadt M., Rathinam M., Murray R. Differential flatness and absolute equivalence; Proceedings of the 1994 33rd IEEE Conference on Decision and Control; Orlando FL, USA. 14–16 December, 1994; Piscataway, NJ, USA: IEEE; 1994. CDC-94. [DOI] [Google Scholar]
80.Olver P.J. Equivalence, Invariants and Symmetry. Cambridge University Press; Cambridge, UK: 1995. [DOI] [Google Scholar]
81.Ivey T.A.t.A. Cartan for Beginners. American Mathematical Society; Providence, RI, USA: 2003. Graduate Studies in Mathematics. [Google Scholar]
82.Pio-Lopez L., Kuchling F., Tung A., Pezzulo G., Levin M. Active inference, morphogenesis, and computational psychiatry. Front. Comput. Neurosci. 2022;16:988977. doi: 10.3389/fncom.2022.988977. [DOI] [PMC free article] [PubMed] [Google Scholar]
83.Badcock P.B., Davey C.G. Active Inference in Psychology and Psychiatry: Progress to Date? Entropy. 2024;26:833. doi: 10.3390/e26100833. [DOI] [PMC free article] [PubMed] [Google Scholar]
84.Harris H.W. Active inference and psychodynamics: A novel integration with applications to depression and stress disorders. Front. Psychiatry. 2025;16:1630858. doi: 10.3389/fpsyt.2025.1630858. [DOI] [PMC free article] [PubMed] [Google Scholar]
85.Adams R.A., Stephan K.E., Brown H.R., Frith C.D., Friston K.J. The Computational Anatomy of Psychosis. Front. Psychiatry. 2013;4:47. doi: 10.3389/fpsyt.2013.00047. [DOI] [PMC free article] [PubMed] [Google Scholar]
86.Adams R.A., Perrinet L.U., Friston K. Smooth Pursuit and Visual Occlusion: Active Inference and Oculomotor Control in Schizophrenia. PLoS ONE. 2012;7:e47502. doi: 10.1371/journal.pone.0047502. [DOI] [PMC free article] [PubMed] [Google Scholar]
87.Perrinet L.U., Adams R.A., Friston K. Active inference, eye movements and oculomotor delays. BMC Neurosci. 2013;14:P133. doi: 10.1186/1471-2202-14-S1-P133. [DOI] [PMC free article] [PubMed] [Google Scholar]
88.Chelouah A. Extensions of differential flat fields and Liouvillian systems; Proceedings of the 36th IEEE Conference on Decision and Control; San Diego, CA, USA. 12 December 1997; Piscataway, NJ, USA: IEEE; 1997. pp. 4268–4273. CDC-97. [DOI] [Google Scholar]
89.Chelouah A. Diffieties and Liouvillian Systems. arXiv. 2010 doi: 10.48550/arXiv.1010.3909.1010.3909 [DOI] [Google Scholar]
90.Chetverikov V.N. Liouville systems and symmetries. Differ. Equ. 2012;48:1639–1651. doi: 10.1134/S0012266112120099. [DOI] [Google Scholar]
91.Nicolau F., Mounier H., Androulakis I.P. HPA axis differential flatness and Liouvillian study for higher resiliency investigations. IMA J. Math. Control Inf. 2023;40:746–788. doi: 10.1093/imamci/dnad030. [DOI] [Google Scholar]
92.Nicolau F., Mounier H. Flatness of Networks of Synaptically Coupled Excitatory-Inhibitory Neural Modules. ESAIM Control. Optim. Calc. Var. 2023;29:89. doi: 10.1051/cocv/2023082. [DOI] [Google Scholar]
93.Karrer T.M., Kim J.Z., Stiso J., Kahn A.E., Pasqualetti F., Habel U., Bassett D.S. A practical guide to methodological considerations in the controllability of structural brain networks. J. Neural Eng. 2020;17:026031. doi: 10.1088/1741-2552/ab6e8b. [DOI] [PMC free article] [PubMed] [Google Scholar]
94.Baggio G., Pasqualetti F., Zampieri S. Energy-Aware Controllability of Complex Networks. Annu. Rev. Control. Robot. Auton. Syst. 2022;5:465–489. doi: 10.1146/annurev-control-042920-014957. [DOI] [Google Scholar]
95.Nandi S., Singh T. Global Sensitivity Analysis on the Bergman Minimal Model. IFAC-PapersOnLine. 2020;53:16112–16118. doi: 10.1016/j.ifacol.2020.12.431. [DOI] [Google Scholar]
96.Bergman R.N. Origins and History of the Minimal Model of Glucose Regulation. Front. Endocrinol. 2021;11:583016. doi: 10.3389/fendo.2020.583016. [DOI] [PMC free article] [PubMed] [Google Scholar]
97.Rudolph J. Well-formed dynamics under quasi-static state feedback. Banach Cent. Publ. 1995;32:349–360. doi: 10.4064/-32-1-349-360. [DOI] [Google Scholar]
98.Crespo T., Hajto Z., Mohseni R. Real Liouvillian extensions of partial differential fields. Symmetry Integr. Geom. Methods Appl. 2021;17:095. doi: 10.3842/SIGMA.2021.095. [DOI] [Google Scholar]
99.Besançon G. Nonlinear Observers and Applications. Springer; Berlin/Heidelberg, Germany: 2007. [DOI] [Google Scholar]
100.Isidori A. Nonlinear Control Theory for Automation. In: Nof S.Y., editor. Springer Handbook of Automation. Springer International Publishing; Cham, Switzerland: 2023. pp. 163–187. [DOI] [Google Scholar]
101.Diop S. Some Control Observation Problems and Their Differential Algebraic Partial Solutions. In: Quadrat A., Zerz E., editors. Algebraic and Symbolic Computation Methods in Dynamical Systems. Springer International Publishing; Cham, Switzerland: 2020. pp. 147–160. [DOI] [Google Scholar]
102.Sira-Ramirez H.J., Garcia Rodriguez C., Cortes Romero J.A., Luviano Juarez A. Algebraic Identification and Estimation Methods in Feedback Control Systems. John Wiley & Sons; Nashville, TN, USA: 2014. Wiley Series in Dynamics and Control of Electromechanical Systems. [Google Scholar]
103.Join C., Delaleau E., Fliess M. Flatness-based control revisited: The HEOL setting. Comptes Rendus. Mathématique. 2024;362:1693–1706. doi: 10.5802/crmath.674. [DOI] [Google Scholar]
104.Greco L., Mounier H., Bekcheva M. An approximate characterisation of the set of feasible trajectories for constrained flat systems. Automatica. 2022;144:110484. doi: 10.1016/j.automatica.2022.110484. [DOI] [Google Scholar]
105.Beaver L.E., Malikopoulos A.A. Optimal control of differentially flat systems is surprisingly easy. Automatica. 2024;159:111404. doi: 10.1016/j.automatica.2023.111404. [DOI] [Google Scholar]
106.Join C., Delaleau E., Fliess M. The Euler-Lagrange Equation and Optimal Control: Preliminary Results; Proceedings of the 2024 12th International Conference on Systems and Control (ICSC); Batna, Algeria. 3–5 November 2024; Piscataway, NJ, USA: IEEE; 2024. pp. 155–160. [DOI] [Google Scholar]
107.Join C., Delaleau E., Fliess M. Model-Free Predictive Control: Introductory Algebraic Calculations, and a Comparison with HEOL and ANNs. IFAC-PapersOnLine. 2025;59:255–260. doi: 10.1016/j.ifacol.2025.10.044. [DOI] [Google Scholar]
108.Richalet J., Rault A., Testud J., Papon J. Model predictive heuristic control: Applications to industrial processes. Automatica. 1978;14:413–428. doi: 10.1016/0005-1098(78)90001-8. [DOI] [Google Scholar]
109.Mayne D.Q. Model predictive control: Recent developments and future promise. Automatica. 2014;50:2967–2986. doi: 10.1016/j.automatica.2014.10.128. [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The data and code presented in this study are openly available in https://github.com/hugues-mounier/AI-FP-differential-flatness-and-smooth-random-realisation/ (accessed on 29 October 2025).

[B1-entropy-28-00087] 1.Friston K. A free energy principle for a particular physics. arXiv. 2019 doi: 10.48550/arXiv.1906.10184.1906.10184 [DOI] [Google Scholar]

[B2-entropy-28-00087] 2.Friston K., Da Costa L., Sajid N., Heins C., Ueltzhöffer K., Pavliotis G.A., Parr T. The free energy principle made simpler but not too simple. Phys. Rep. 2023;1024:1–29. doi: 10.1016/j.physrep.2023.07.001. [DOI] [Google Scholar]

[B3-entropy-28-00087] 3.Ramstead M.J.D., Sakthivadivel D.A.R., Heins C., Koudahl M., Millidge B., Da Costa L., Klein B., Friston K.J. On Bayesian mechanics: A physics of and by beliefs. Interface Focus. 2023;13:20220029. doi: 10.1098/rsfs.2022.0029. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B4-entropy-28-00087] 4.Fliess M., Glad S.T. Essays on Control. Birkhäuser; Boston, TX, USA: 1993. An Algebraic Approach to Linear and Nonlinear Control; pp. 223–267. [DOI] [Google Scholar]

[B5-entropy-28-00087] 5.Fliess M., Lévine J., Martin P., Rouchon P. Flatness and defect of non-linear systems: Introductory theory and examples. Int. J. Control. 1995;61:1327–1361. doi: 10.1080/00207179508921959. [DOI] [Google Scholar]

[B6-entropy-28-00087] 6.Fliess M., Join C., Ramirez H.S. Non-linear estimation is easy. Int. J. Model. Identif. Control. 2008;4:12. doi: 10.1504/IJMIC.2008.020996. [DOI] [Google Scholar]

[B7-entropy-28-00087] 7.Baltieri M. PhD thesis. University of Sussex; Brighton, UK: 2019. Active Inference: Building a New Bridge Between Control Theory and Embodied Cognitive Science. [Google Scholar]

[B8-entropy-28-00087] 8.Baltieri M., Buckley C. PID Control as a Process of Active Inference with Linear Generative Models. Entropy. 2019;21:257. doi: 10.3390/e21030257. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B9-entropy-28-00087] 9.Lanillos P., Meo C., Pezzato C., Meera A.A., Baioumy M., Ohata W., Tschantz A., Millidge B., Wisse M., Buckley C.L., et al. Active Inference in Robotics and Artificial Agents: Survey and Challenges. arXiv. 2021 doi: 10.48550/arXiv.2112.01871.2112.01871 [DOI] [Google Scholar]

[B10-entropy-28-00087] 10.Da Costa L., Friston K., Heins C., Pavliotis G.A. Bayesian mechanics for stationary processes. Proc. R. Soc. A Math. Phys. Eng. Sci. 2021;477:20210518. doi: 10.1098/rspa.2021.0518. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B11-entropy-28-00087] 11.Meera A.A., Wisse M. A Brain Inspired Learning Algorithm for the Perception of a Quadrotor in Wind. arXiv. 2021 doi: 10.48550/arXiv.2109.11971.2109.11971 [DOI] [Google Scholar]

[B12-entropy-28-00087] 12.Tschantz A., Barca L., Maisto D., Buckley C.L., Seth A.K., Pezzulo G. Simulating homeostatic, allostatic and goal-directed forms of interoceptive control using Active Inference. Biol. Psychol. 2022;169:108266. doi: 10.1016/j.biopsycho.2022.108266. [DOI] [PubMed] [Google Scholar]

[B13-entropy-28-00087] 13.Baioumy M., Pezzato C., Ferrari R., Hawes N. Unbiased Active Inference for Classical Control. arXiv. 2022 doi: 10.48550/arXiv.2207.13409.2207.13409 [DOI] [Google Scholar]

[B14-entropy-28-00087] 14.Bos F., Meera A.A., Benders D., Wisse M. Free Energy Principle for State and Input Estimation of a Quadcopter Flying in Wind; Proceedings of the 2022 International Conference on Robotics and Automation (ICRA); Philadelphia, PA, USA. 23–27 May 2022; Piscataway, NJ, USA: IEEE; 2022. [DOI] [Google Scholar]

[B15-entropy-28-00087] 15.Rudolph J. Flatness-Based Control—An Introduction. 1st ed. Shaker GmbH; Düren, Germany: 2021. [DOI] [Google Scholar]

[B16-entropy-28-00087] 16.Francine Diener M.D., editor. Nonstandard Analysis in Practice. Springer; Berlin/Heidelberg, Germany: 1995. [DOI] [Google Scholar]

[B17-entropy-28-00087] 17.Fliess M. Probabilités et fluctuations quantiques. Comptes Rendus. MathéMatique. 2007;344:663–668. doi: 10.1016/j.crma.2007.04.001. [DOI] [Google Scholar]

[B18-entropy-28-00087] 18.Fliess M., Join C. Towards a new viewpoint on causality for time series. ESAIM Proc. Surv. 2015;49:37–52. doi: 10.1051/proc/201549004. [DOI] [Google Scholar]

[B19-entropy-28-00087] 19.Han X., Kloeden P.E. Random Ordinary Differential Equations and Their Numerical Solution. Springer; Singapore: 2017. [DOI] [Google Scholar]

[B20-entropy-28-00087] 20.Friz P.K., Hairer M. A Course on Rough Paths: With an Introduction to Regularity Structures. Springer International Publishing; Cham, Switzerland: 2020. [DOI] [Google Scholar]

[B21-entropy-28-00087] 21.Kahane J.P. Some Random Series of Functions Series. 2nd ed. Cambridge University Press; Cambridge, UK: 1993. Number 5 in Cambridge Studies in Advanced Mathematics. [Google Scholar]

[B22-entropy-28-00087] 22.Hill M. Convergence of random Fourier series; Proceedings of the University of Chicago 2012 Summer Program for Undergraduate; 2012. [(accessed on 29 October 2025)]. Available online: https://math.uchicago.edu/~may/REU2012/REUPapers/Hill.pdf. [Google Scholar]

[B23-entropy-28-00087] 23.Tzvetkov N. Riemannian analogue of a Paley-Zygmund theorem. Séminaire Équations aux Dérivées Partielles (2008–2009) [(accessed on 29 October 2025)]. pp. 1–14. Available online: https://eudml.org/doc/11189.

[B24-entropy-28-00087] 24.Marcus M.B., Pisier G. Random Fourier Series with Applications to Harmonic Analysis (AM-101) Princeton University Press; Princeton, NJ, USA: 1981. [Google Scholar]

[B25-entropy-28-00087] 25.Filip S., Javeed A., Trefethen L.N. Smooth Random Functions, Random ODEs, and Gaussian Processes. SIAM Rev. 2019;61:185–205. doi: 10.1137/17M1161853. [DOI] [Google Scholar]

[B26-entropy-28-00087] 26.Robert J., Adler J.E.T. Random Fields and Geometry. Springer; New York, NY, USA: 2007. [DOI] [Google Scholar]

[B27-entropy-28-00087] 27.Adler R.J., Taylor J.E. Topological Complexity of Smooth Random Functions: École d’Été de Probabilités de Saint-Flour XXXIX-2009. Springer; Berlin/Heidelberg, Germany: 2011. [DOI] [Google Scholar]

[B28-entropy-28-00087] 28.Adler R.J., Taylor J.E., Worsley K.J. Applications of Random Fields and Geometry: Foundations and Case Studies. Book in Progress. 2016. [(accessed on 29 October 2025)]. Available online: https://cris.technion.ac.il/en/publications/applications-of-random-fields-and-geometry-foundations-and-case-s/

[B29-entropy-28-00087] 29.Friston K.J., Holmes A.P., Worsley K.J., Poline J., Frith C.D., Frackowiak R.S.J. Statistical parametric maps in functional imaging: A general linear approach. Hum. Brain Mapp. 1994;2:189–210. doi: 10.1002/hbm.460020402. [DOI] [Google Scholar]

[B30-entropy-28-00087] 30.Worsley K.J., Marrett S., Neelin P., Vandal A.C., Friston K.J., Evans A.C. A unified statistical approach for determining significant signals in images of cerebral activation. Hum. Brain Mapp. 1996;4:58–73. doi: 10.1002/(SICI)1097-0193(1996)4:1<58::AID-HBM4>3.0.CO;2-O. [DOI] [PubMed] [Google Scholar]

[B31-entropy-28-00087] 31.Gyöngy I., Michaletzky G. On Wong–Zakai approximations with δ–martingales. Proc. R. Soc. London. Ser. A Math. Phys. Eng. Sci. 2004;460:309–324. doi: 10.1098/rspa.2003.1244. [DOI] [Google Scholar]

[B32-entropy-28-00087] 32.Da Costa L., Da Costa N., Heins C., Medrano J., Pavliotis G.A., Parr T., Meera A.A., Friston K. A Theory of Generalized Coordinates for Stochastic Differential Equations. Stud. Appl. Math. 2025;154:e70062. doi: 10.1111/sapm.70062. [DOI] [Google Scholar]

[B33-entropy-28-00087] 33.Stratonovich R.L. Topics in the Theory of Random Noise. Volume 2 Macmillan Education; South Yarra, Australia: 1967. [Google Scholar]

[B34-entropy-28-00087] 34.Esser C., Jaffard S., Vedel B. Regularity properties of random wavelet series. arXiv. 2023 doi: 10.1090/tpms/1205.2304.00811 [DOI] [Google Scholar]

[B35-entropy-28-00087] 35.Aubry J.M., Jaffard S. Emergent Nature. World Scientific; Singapore: 2002. Random wavelet series: Theory and applications. [DOI] [Google Scholar]

[B36-entropy-28-00087] 36.Adler R.J. On excursion sets, tube formulas and maxima of random fields. Ann. Appl. Probab. 2000;10:1–74. doi: 10.1214/aoap/1019737664. [DOI] [Google Scholar]

[B37-entropy-28-00087] 37.Krishnan S.R., Taylor J.E., Adler R.J. The Intrinsic geometry of some random manifolds. Electron. Commun. Probab. 2017;22:1–12. doi: 10.1214/16-ecp4763. [DOI] [Google Scholar]

[B38-entropy-28-00087] 38.Yuan R.S., Ma Y.A., Yuan B., Ao P. Lyapunov function as potential function: A dynamical equivalence. Chin. Phys. B. 2014;23:010505. doi: 10.1088/1674-1056/23/1/010505. [DOI] [Google Scholar]

[B39-entropy-28-00087] 39.Parr T., Da Costa L., Friston K. Markov blankets, information geometry and stochastic thermodynamics. Philos. Trans. R. Soc. A Math. Phys. Eng. Sci. 2019;378:20190159. doi: 10.1098/rsta.2019.0159. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B40-entropy-28-00087] 40.Wang H., Hu W., Gan X., Ao P. The generalized Lyapunov function as Ao’s potential function: Existence in dimensions 1 and 2. J. Appl. Anal. Comput. 2023;13:359–375. doi: 10.11948/20220149. [DOI] [Google Scholar]

[B41-entropy-28-00087] 41.Khalil H.K. Nonlinear Control. Pearson; Upper Saddle River, NJ, USA: 2014. [Google Scholar]

[B42-entropy-28-00087] 42.Fliess M. A note on the invertibility of nonlinear input-output differential systems. Syst. Control Lett. 1986;8:147–151. doi: 10.1016/0167-6911(86)90073-3. [DOI] [Google Scholar]

[B43-entropy-28-00087] 43.Fliess M., Lévine J., Martin P., Rouchon P. On Differentially Flat Nonlinear Systems. IFAC Proc. Vol. 1992;25:159–163. doi: 10.1016/s1474-6670(17)52275-2. [DOI] [Google Scholar]

[B44-entropy-28-00087] 44.Fliess M., Lévine J., Martin P., Rouchon P. A Lie-Bäcklund approach equivalence and flatness of nonlinear systems. IEEE Trans. Automat. Control. 1999;44:922–937. doi: 10.1109/9.763209. [DOI] [Google Scholar]

[B45-entropy-28-00087] 45.Sira-Ramírez H. Differentially Flat Systems. CRC Press; Boca Raton, FL, USA: 2004. [Google Scholar]

[B46-entropy-28-00087] 46.Hilbert D. Über den Begriff der Klasse von Differentialgleichungen. Math. Ann. 1912;73:95–108. doi: 10.1007/BF01456663. [DOI] [Google Scholar]

[B47-entropy-28-00087] 47.Cartan M. Sur l’équivalence absolue de certains systèmes d’équations différentielles et sur certaines familles de courbes. Bull. Soc. Math. Fr. 1914;2:12–48. doi: 10.24033/bsmf.938. [DOI] [Google Scholar]

[B48-entropy-28-00087] 48.Fuller S., Greiner B., Moore J., Murray R., van Paassen R., Yorke R. The Python Control Systems Library (python-control); Proceedings of the 60th IEEE Conference on Decision and Control (CDC); Austin, TX, USA. 14–17 December 2021; Piscataway, NJ, USA: IEEE; 2021. pp. 4875–4881. [Google Scholar]

[B49-entropy-28-00087] 49.Perrin D. Algebraic Geometry. Springer; London, UK: 2008. [DOI] [Google Scholar]

[B50-entropy-28-00087] 50.Eisenbud D., Harris J. The Geometry of Schemes. 1st ed. Springer; New York, NY, USA: 2000. Graduate Texts in Mathematics. [Google Scholar]

[B51-entropy-28-00087] 51.Agrios M. A Very Elementary Introduction to Sheaves. arXiv. 2022 doi: 10.48550/arXiv.2202.01379.2202.01379 [DOI] [Google Scholar]

[B52-entropy-28-00087] 52.Curry J. Sheaves, Cosheaves and Applications. arXiv. 20131303.3255 [Google Scholar]

[B53-entropy-28-00087] 53.Perrinet L.U., Adams R.A., Friston K.J. Active inference, eye movements and oculomotor delays. Biol. Cybern. 2014;108:777–801. doi: 10.1007/s00422-014-0620-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B54-entropy-28-00087] 54.Adams R.A., Aponte E., Marshall L., Friston K.J. Active inference and oculomotor pursuit: The dynamic causal modelling of eye movements. J. Neurosci. Methods. 2015;242:1–14. doi: 10.1016/j.jneumeth.2015.01.003. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B55-entropy-28-00087] 55.Parr T., Friston K.J. Active inference and the anatomy of oculomotion. Neuropsychologia. 2018;111:334–343. doi: 10.1016/j.neuropsychologia.2018.01.041. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B56-entropy-28-00087] 56.Parr T., Friston K.J. The computational pharmacology of oculomotion. Psychopharmacology. 2019;236:2473–2484. doi: 10.1007/s00213-019-05240-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B57-entropy-28-00087] 57.Harris D., Vine S., Wilson M., Arthur T. The Relationship Between Environmental Statistics and Predictive Gaze Behaviour During a Manual Interception Task: Eye Movements as Active Inference. Comput. Brain Behav. 2023;7:225–241. doi: 10.1007/s42113-023-00190-5. [DOI] [Google Scholar]

[B58-entropy-28-00087] 58.Wong A. Listing’s law: Clinical significance and implications for neural control. Surv. Ophthalmol. 2004;49:563–575. doi: 10.1016/S0039-6257(04)00134-1. [DOI] [PubMed] [Google Scholar]

[B59-entropy-28-00087] 59.Ekstrand B. Equations of motion for a two-axes gimbal system. IEEE Trans. Aerosp. Electron. Syst. 2001;37:1083–1091. doi: 10.1109/7.953259. [DOI] [Google Scholar]

[B60-entropy-28-00087] 60.Siciliano B., Khatib O., editors. Springer Handbook of Robotics. Springer International Publishing; Cham, Switzerland: 2016. [DOI] [Google Scholar]

[B61-entropy-28-00087] 61.Fliess M., Join C., Moussa K., Djouadi S.M., Alsager M.W. Toward simple in silico experiments for drugs administration in some cancer treatments. IFAC-PapersOnLine. 2021;54:245–250. doi: 10.1016/j.ifacol.2021.10.263. [DOI] [Google Scholar]

[B62-entropy-28-00087] 62.Sira-Ramirez H., Luviano-Juárez A., Ramírez-Neria M., Zurita-Bustamante E.W. Active Disturbance Rejection Control of Dynamic Systems: A Flatness Based Approach. Butterworth-Heinemann; Oxford, UK: 2017. [Google Scholar]

[B63-entropy-28-00087] 63.Sira-Ramírez H., Gómez-León B.C., Aguilar-Orduña M.A. A two stage Active Disturbance Rejection Control design for under-actuated nonlinear systems. IFAC-PapersOnLine. 2023;56:4539–4544. doi: 10.1016/j.ifacol.2023.10.949. [DOI] [Google Scholar]

[B64-entropy-28-00087] 64.Utkin V.I. Sliding Modes in Control and Optimization. Springer; Berlin/Heidelberg, Germany: 1992. [DOI] [Google Scholar]

[B65-entropy-28-00087] 65.Shtessel Y., Edwards C., Fridman L., Levant A. Sliding Mode Control and Observation. Springer; New York, NY, USA: 2014. [DOI] [Google Scholar]

[B66-entropy-28-00087] 66.Fliess M., Join C. Model-free control. Int. J. Control. 2013;86:2228–2252. doi: 10.1080/00207179.2013.810345. [DOI] [Google Scholar]

[B67-entropy-28-00087] 67.Fliess M., Mounier H. Controllability and observability of linear delay systems: An algebraic approach. ESAIM Control. Optim. Calc. Var. 1998;3:301–314. doi: 10.1051/cocv:1998111. [DOI] [Google Scholar]

[B68-entropy-28-00087] 68.Mounier H., Rudolph J. Flatness-based control of nonlinear delay systems: A chemical reactor example. Int. J. Control. 1998;71:871–890. doi: 10.1080/002071798221614. [DOI] [Google Scholar]

[B69-entropy-28-00087] 69.Mounier H., Rudolph J. Flatness and quasi-static state feedback in non-linear delay systems. Int. J. Control. 2008;81:445–456. doi: 10.1080/00207170701579437. [DOI] [Google Scholar]

[B70-entropy-28-00087] 70.Estrada-Sánchez I., Velasco-Villa M., Rodríguez-Cortés H. Prediction-Based Control for Nonlinear Systems with Input Delay. Math. Probl. Eng. 2017;2017:7415418. doi: 10.1155/2017/7415418. [DOI] [Google Scholar]

[B71-entropy-28-00087] 71.Krstic M. Input Delay Compensation for Forward Complete and Strict-Feedforward Nonlinear Systems. IEEE Trans. Autom. Control. 2010;55:287–303. doi: 10.1109/TAC.2009.2034923. [DOI] [Google Scholar]

[B72-entropy-28-00087] 72.Bresch-Pietri D., Petit N., Krstic M. Prediction-based control for nonlinear state- and input-delay systems with the aim of delay-robustness analysis; Proceedings of the 2015 54th IEEE Conference on Decision and Control (CDC); Osaka, Japan. 15–18 December 2015; Piscataway, NJ, USA: IEEE; 2015. pp. 6403–6409. [DOI] [Google Scholar]

[B73-entropy-28-00087] 73.Karafyllis I., Krstic M. Predictor Feedback for Delay Systems: Implementations and Approximations. Springer International Publishing; Cham, Switzerland: 2017. [DOI] [Google Scholar]

[B74-entropy-28-00087] 74.Rodino . Linear Partial Differential Operators in Gevrey Spaces. World Scientific; Singapore: 1993. pp. 5–59. [DOI] [Google Scholar]

[B75-entropy-28-00087] 75.Balser W. Formal Power Series and Linear Systems of Meromorphic Ordinary Differential Equations. Springer; New York, NY, USA: 2000. [DOI] [Google Scholar]

[B76-entropy-28-00087] 76.Teofanov N., Tomić F., Žigić M. An introduction to extended Gevrey regularity. arXiv. 2024 doi: 10.3390/axioms13060352.2404.17366 [DOI] [Google Scholar]

[B77-entropy-28-00087] 77.Lindquist A., Picci G. On the Stochastic Realization Problem. SIAM J. Control Optim. 1979;17:365–389. doi: 10.1137/0317028. [DOI] [Google Scholar]

[B78-entropy-28-00087] 78.Veeravalli T., Raginsky M. Revisiting Stochastic Realization Theory using Functional Itô Calculus. arXiv. 2024 doi: 10.1016/j.ifacol.2024.10.190.2402.10157 [DOI] [Google Scholar]

[B79-entropy-28-00087] 79.van Nieuwstadt M., Rathinam M., Murray R. Differential flatness and absolute equivalence; Proceedings of the 1994 33rd IEEE Conference on Decision and Control; Orlando FL, USA. 14–16 December, 1994; Piscataway, NJ, USA: IEEE; 1994. CDC-94. [DOI] [Google Scholar]

[B80-entropy-28-00087] 80.Olver P.J. Equivalence, Invariants and Symmetry. Cambridge University Press; Cambridge, UK: 1995. [DOI] [Google Scholar]

[B81-entropy-28-00087] 81.Ivey T.A.t.A. Cartan for Beginners. American Mathematical Society; Providence, RI, USA: 2003. Graduate Studies in Mathematics. [Google Scholar]

[B82-entropy-28-00087] 82.Pio-Lopez L., Kuchling F., Tung A., Pezzulo G., Levin M. Active inference, morphogenesis, and computational psychiatry. Front. Comput. Neurosci. 2022;16:988977. doi: 10.3389/fncom.2022.988977. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B83-entropy-28-00087] 83.Badcock P.B., Davey C.G. Active Inference in Psychology and Psychiatry: Progress to Date? Entropy. 2024;26:833. doi: 10.3390/e26100833. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B84-entropy-28-00087] 84.Harris H.W. Active inference and psychodynamics: A novel integration with applications to depression and stress disorders. Front. Psychiatry. 2025;16:1630858. doi: 10.3389/fpsyt.2025.1630858. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B85-entropy-28-00087] 85.Adams R.A., Stephan K.E., Brown H.R., Frith C.D., Friston K.J. The Computational Anatomy of Psychosis. Front. Psychiatry. 2013;4:47. doi: 10.3389/fpsyt.2013.00047. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B86-entropy-28-00087] 86.Adams R.A., Perrinet L.U., Friston K. Smooth Pursuit and Visual Occlusion: Active Inference and Oculomotor Control in Schizophrenia. PLoS ONE. 2012;7:e47502. doi: 10.1371/journal.pone.0047502. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B87-entropy-28-00087] 87.Perrinet L.U., Adams R.A., Friston K. Active inference, eye movements and oculomotor delays. BMC Neurosci. 2013;14:P133. doi: 10.1186/1471-2202-14-S1-P133. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B88-entropy-28-00087] 88.Chelouah A. Extensions of differential flat fields and Liouvillian systems; Proceedings of the 36th IEEE Conference on Decision and Control; San Diego, CA, USA. 12 December 1997; Piscataway, NJ, USA: IEEE; 1997. pp. 4268–4273. CDC-97. [DOI] [Google Scholar]

[B89-entropy-28-00087] 89.Chelouah A. Diffieties and Liouvillian Systems. arXiv. 2010 doi: 10.48550/arXiv.1010.3909.1010.3909 [DOI] [Google Scholar]

[B90-entropy-28-00087] 90.Chetverikov V.N. Liouville systems and symmetries. Differ. Equ. 2012;48:1639–1651. doi: 10.1134/S0012266112120099. [DOI] [Google Scholar]

[B91-entropy-28-00087] 91.Nicolau F., Mounier H., Androulakis I.P. HPA axis differential flatness and Liouvillian study for higher resiliency investigations. IMA J. Math. Control Inf. 2023;40:746–788. doi: 10.1093/imamci/dnad030. [DOI] [Google Scholar]

[B92-entropy-28-00087] 92.Nicolau F., Mounier H. Flatness of Networks of Synaptically Coupled Excitatory-Inhibitory Neural Modules. ESAIM Control. Optim. Calc. Var. 2023;29:89. doi: 10.1051/cocv/2023082. [DOI] [Google Scholar]

[B93-entropy-28-00087] 93.Karrer T.M., Kim J.Z., Stiso J., Kahn A.E., Pasqualetti F., Habel U., Bassett D.S. A practical guide to methodological considerations in the controllability of structural brain networks. J. Neural Eng. 2020;17:026031. doi: 10.1088/1741-2552/ab6e8b. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B94-entropy-28-00087] 94.Baggio G., Pasqualetti F., Zampieri S. Energy-Aware Controllability of Complex Networks. Annu. Rev. Control. Robot. Auton. Syst. 2022;5:465–489. doi: 10.1146/annurev-control-042920-014957. [DOI] [Google Scholar]

[B95-entropy-28-00087] 95.Nandi S., Singh T. Global Sensitivity Analysis on the Bergman Minimal Model. IFAC-PapersOnLine. 2020;53:16112–16118. doi: 10.1016/j.ifacol.2020.12.431. [DOI] [Google Scholar]

[B96-entropy-28-00087] 96.Bergman R.N. Origins and History of the Minimal Model of Glucose Regulation. Front. Endocrinol. 2021;11:583016. doi: 10.3389/fendo.2020.583016. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B97-entropy-28-00087] 97.Rudolph J. Well-formed dynamics under quasi-static state feedback. Banach Cent. Publ. 1995;32:349–360. doi: 10.4064/-32-1-349-360. [DOI] [Google Scholar]

[B98-entropy-28-00087] 98.Crespo T., Hajto Z., Mohseni R. Real Liouvillian extensions of partial differential fields. Symmetry Integr. Geom. Methods Appl. 2021;17:095. doi: 10.3842/SIGMA.2021.095. [DOI] [Google Scholar]

[B99-entropy-28-00087] 99.Besançon G. Nonlinear Observers and Applications. Springer; Berlin/Heidelberg, Germany: 2007. [DOI] [Google Scholar]

[B100-entropy-28-00087] 100.Isidori A. Nonlinear Control Theory for Automation. In: Nof S.Y., editor. Springer Handbook of Automation. Springer International Publishing; Cham, Switzerland: 2023. pp. 163–187. [DOI] [Google Scholar]

[B101-entropy-28-00087] 101.Diop S. Some Control Observation Problems and Their Differential Algebraic Partial Solutions. In: Quadrat A., Zerz E., editors. Algebraic and Symbolic Computation Methods in Dynamical Systems. Springer International Publishing; Cham, Switzerland: 2020. pp. 147–160. [DOI] [Google Scholar]

[B102-entropy-28-00087] 102.Sira-Ramirez H.J., Garcia Rodriguez C., Cortes Romero J.A., Luviano Juarez A. Algebraic Identification and Estimation Methods in Feedback Control Systems. John Wiley & Sons; Nashville, TN, USA: 2014. Wiley Series in Dynamics and Control of Electromechanical Systems. [Google Scholar]

[B103-entropy-28-00087] 103.Join C., Delaleau E., Fliess M. Flatness-based control revisited: The HEOL setting. Comptes Rendus. Mathématique. 2024;362:1693–1706. doi: 10.5802/crmath.674. [DOI] [Google Scholar]

[B104-entropy-28-00087] 104.Greco L., Mounier H., Bekcheva M. An approximate characterisation of the set of feasible trajectories for constrained flat systems. Automatica. 2022;144:110484. doi: 10.1016/j.automatica.2022.110484. [DOI] [Google Scholar]

[B105-entropy-28-00087] 105.Beaver L.E., Malikopoulos A.A. Optimal control of differentially flat systems is surprisingly easy. Automatica. 2024;159:111404. doi: 10.1016/j.automatica.2023.111404. [DOI] [Google Scholar]

[B106-entropy-28-00087] 106.Join C., Delaleau E., Fliess M. The Euler-Lagrange Equation and Optimal Control: Preliminary Results; Proceedings of the 2024 12th International Conference on Systems and Control (ICSC); Batna, Algeria. 3–5 November 2024; Piscataway, NJ, USA: IEEE; 2024. pp. 155–160. [DOI] [Google Scholar]

[B107-entropy-28-00087] 107.Join C., Delaleau E., Fliess M. Model-Free Predictive Control: Introductory Algebraic Calculations, and a Comparison with HEOL and ANNs. IFAC-PapersOnLine. 2025;59:255–260. doi: 10.1016/j.ifacol.2025.10.044. [DOI] [Google Scholar]

[B108-entropy-28-00087] 108.Richalet J., Rault A., Testud J., Papon J. Model predictive heuristic control: Applications to industrial processes. Automatica. 1978;14:413–428. doi: 10.1016/0005-1098(78)90001-8. [DOI] [Google Scholar]

[B109-entropy-28-00087] 109.Mayne D.Q. Model predictive control: Recent developments and future promise. Automatica. 2014;50:2967–2986. doi: 10.1016/j.automatica.2014.10.128. [DOI] [Google Scholar]

PERMALINK

Active Inference and Functional Parametrisation: Differential Flatness and Smooth Random Realisation

Hugues Mounier

Thomas Parr

Karl Friston

Roles

Abstract

1. Introduction

2. Preliminary Notions: Generative Model, Action, State, and Fluctuation Choice

2.1. Generative Models

Definition 1 (Generative model).

Definition 2 (Mean generative model).

2.2. Action, Output, and State

Definition 3 (action (control input) and output).

Remark 1 (Local genericity).

Definition 4 (state).

Definition 5 (State representation).

Definition 6 (Realisation).

2.3. Fluctuation Choice

Remark 2 (Wavelet random series).

Example 1 (Smooth random functions).

Figure 1.

Example 2 (Simple generic example; model).

3. Free Energy, Flatness, and Conceptual Similarities

3.1. Free and Expected Free Energy

Table 1.

3.2. Differential Flatness

3.2.1. Controllability

Definition 7.

3.2.2. Motivation Through Observation and Action

3.2.3. Motivation Through Direct and Inverse Views

3.2.4. Formal Definition

Definition 8 (Differential flatness).

Remark 3 (Differential flatness–state-space form case).

Remark 4.

Remark 5 (Local genericity).

Proposition 1.

Remark 6 (Tracking an arbitrary output).

3.2.5. Functional Parameterisation

3.3. Conceptual Similarities

Example 3 (Simple generic example; similarities).

4. References, Flatness-Based Trajectory Tracking, and Perceptual and Active Inferences

4.1. Equivalence to Linearity

4.1.1. Differential Flatness Characterisation

Proposition 2.

Definition 9.

4.1.2. Dynamical Extension Algorithm

Phase I—Weak Brunovský Index Gathering

Phase II—Flatness Character Determination

Phase III—Linearizing Feedback

4.2. Differential Flatness and Controllability

Proposition 3.

Proof.

Example 4 (Oculomotor dynamics and flatness).

Figure 2.

Figure 3.

Figure 4.

Figure 5.

Figure 6.

Figure 7.

4.3. Trajectory Design and Planning

4.4. Synthesis Law Computations: Tracking Controller

4.4.1. General Action Tracking Law

Remark 7 (Open-loop and model-free).

Example 5 (Simple generic example; tracking).

4.4.2. Oculomotor Example

Example 6 (Oculomotor tracking).

Remark 8 (A more realistic example).

4.4.3. Simulations

Figure 8.

Figure 9.

Figure 10.

Figure 11.

4.5. Active Inference

4.6. Link with Flatness-Based Tracking

Example 7 (Oculomotor tracking control law).

5. Prediction as a Link Between Active Inference and Differential Flatness

5.1. Delays and δ-Flatness

Example 8.

5.2. Trajectory Tracking and Predictors

5.1. Delays and $δ$ -Flatness