Data-driven linearization of dynamical systems

George Haller; Bálint Kaszás

doi:10.1007/s11071-024-10026-x

. 2024 Aug 15;112(21):18639–18663. doi: 10.1007/s11071-024-10026-x

Data-driven linearization of dynamical systems

George Haller ^1,^✉, Bálint Kaszás ¹

PMCID: PMC11362512 PMID: 39219721

Abstract

Dynamic mode decomposition (DMD) and its variants, such as extended DMD (EDMD), are broadly used to fit simple linear models to dynamical systems known from observable data. As DMD methods work well in several situations but perform poorly in others, a clarification of the assumptions under which DMD is applicable is desirable. Upon closer inspection, existing interpretations of DMD methods based on the Koopman operator are not quite satisfactory: they justify DMD under assumptions that hold only with probability zero for generic observables. Here, we give a justification for DMD as a local, leading-order reduced model for the dominant system dynamics under conditions that hold with probability one for generic observables and non-degenerate observational data. We achieve this for autonomous and for periodically forced systems of finite or infinite dimensions by constructing linearizing transformations for their dominant dynamics within attracting slow spectral submanifolds (SSMs). Our arguments also lead to a new algorithm, data-driven linearization (DDL), which is a higher-order, systematic linearization of the observable dynamics within slow SSMs. We show by examples how DDL outperforms DMD and EDMD on numerical and experimental data.

Supplementary Information

The online version contains supplementary material available at 10.1007/s11071-024-10026-x.

Introduction

In recent years, there has been an overwhelming interest in devising linear models for dynamical systems from experimental or numerical data (see the recent review by Schmid [51]). This trend was largely started by the dynamic mode decomposition (DMD), put forward in seminal work by Schmid [50]. The original exposition of the method has been streamlined by various authors, most notably by Rowley et al. [49] and Kutz et al. [32].

To describe DMD, we consider an autonomous dynamical system

\begin{matrix} \dot{x} = f (x), x \in R^{n}, f \in C^{1} (R^{n}), \end{matrix}

for some $n \in N^{+}$ . Trajectories ${\{x (t ; x_{0})\}}_{t \in R}$ of this system evolve from initial conditions $x_{0}$ . The flow map $F^{t} : R^{n} \to R^{n}$ is defined as the mapping taking the initial trajectory positions at time $t_{0} = 0$ to current ones at time t, i.e.,

\begin{matrix} F^{t} (x_{0}) = x (t ; x_{0}) . \end{matrix}

As observations of the full state space variable $x$ of system (1) are often not available, one may try to explore the dynamical system (1) by observing d smooth scalar functions $ϕ_{1} (x), \dots, ϕ_{d} (x)$ along trajectories of the system. We order these scalar observables into the observable vector

\begin{matrix} ϕ (x) = (\begin{matrix} ϕ_{1} (x) \\ ⋮ \\ ϕ_{d} (x) \end{matrix}) \in C^{1} (R^{n}) \end{matrix}

The basic idea of DMD is to approximate the observed evolution of $ϕ (F^{t}, (x_{0}))$ of the dynamical system with the closest fitting autonomous linear dynamical system

\begin{matrix} \dot{ϕ} = L ϕ, L \in R^{d \times d}, \end{matrix}

based on available trajectory observations.

This is a challenging objective for multiple reasons. First, the original dynamical system (1) is generally nonlinear whose dynamics cannot be well approximated by a single linear system on a sizable open domain. For instance, one may have several isolated, coexisting attracting or repelling stationary states (such as periodic orbits, limit cycles or quasiperiodic tori), which linear systems cannot have. Second, it is unclear why the dynamics of d observables should be governed by a self-contained autonomous dynamical system induced by the original system (1), whose dimension is n. Third, the result of fitting system (4) to observable data will clearly depend on the initial conditions used, the number and the functional form of the observables chosen, as well as on the objective function used in minimizing the fit.

Despite these challenges, we may proceed to find an appropriately defined closest linear system (4) based on available observable data. We assume that for some fixed time step $Δ t$ , discrete observations of m initial conditions, $x^{1} (t_{0}), \dots, x^{m} (t_{0})$ , and their images $F^{Δ t} (x^{1} (t_{0})), \dots, F^{Δ t} (x^{m}, (t_{0}))$ , under the sampled flow map $F^{Δ t}$ are available in the data matrices

\begin{matrix} Φ = & [ϕ (x^{1}, (t_{0})), \dots, ϕ (x^{m}, (t_{0}))], \\ \hat{Φ} = & [ϕ (F^{Δ t}, (x^{1} (t_{0}))), \dots, ϕ (F^{Δ t}, (x^{m}, (t_{0})))], \end{matrix}

respectively. We seek the best fitting linear system of the form (4) for which

\begin{matrix} \hat{Φ} \approx D Φ, D = e^{L Δ t}, \end{matrix}

holds. The eigenvalues of such a $D$ are usually called DMD eigenvalues, and their corresponding eigenvectors are called the DMD modes.

Various norms can be chosen with respect to which the difference of $\hat{Φ}$ and $D Φ$ is to be minimized. The most straightforward choice is the Euclidean matrix norm $|\cdot|$ , which leads to the minimization principle

\begin{matrix} D^{*} = \underset{D \in R^{d \times d}}{argmin {|\hat{Φ} - D Φ|}^{2}} . \end{matrix}

An explicit solution to this problem is given by

\begin{matrix} D = (\hat{Φ}, Φ^{T}) {(\hat{Φ}, Φ^{T})}^{†}, \end{matrix}

with the dagger referring to the pseudo-inverse of a matrix (see, e.g., Kutz et al. [32] for details). We note that the original formulation of Schmid [50] is for discrete dynamical processes and assumes observations of a single trajectory (see also Rowley et al. [49]).

Among several later variants of DMD surveyed by Schmid [51], the most broadly used one is the Extended Dynamic Mode Decomposition (EDMD) of Williams et al. [58]. This procedure seeks the best-fitting linear dynamics for an a priori unknown set of functions $K (ϕ (x))$ of $ϕ (x)$ , rather than for $ϕ (x)$ itself. In practice, one often chooses $K$ as an N(d, k)-dimensional vector of d-variate scalar monomials of order k or less, where $N (d, k) = (\begin{matrix} d + k \\ k \end{matrix})$ is the total number of all such monomials. The underlying assumption of EDMD is that a self-contained linear dynamical system of the form

\begin{matrix} \frac{d}{dt} K (ϕ (F^{t} (x_{0}))) = L K (ϕ (F^{t} (x_{0}))) \end{matrix}

can be obtained on the feature space $R^{N (d, k)}$ by optimally selecting $L \in R^{N (d, k) \times N (d, k)}$ . For physical systems, the N(d, k)-dimensional ODE in Eq. (9) defined on the feature space $R^{N (d, k)}$ can be substantially higher-dimensional than the d-dimensional ODE (4). In fact, N(d, k) may be substantially higher than the dimension n of the phase space $R^{n}$ of the original nonlinear system (1).

Once the function library used in EDMD is fixed, one again seeks to choose $L$ so that

\begin{matrix} K (\hat{Φ}) \approx D K (Φ), D = e^{L Δ t} . \end{matrix}

This again leads to a linear optimization problem that can be solved using linear algebra tools. For higher-dimensional systems, a kernel-based version of EDMD was developed by Williams et al. [59]. This method computes inner products necessary for EDMD implicitly, without requiring an explicit representation of (polynomial) basis functions in the space of observables. As a result, kernel-based EDMD operates at computational costs comparable to those of the original DMD.

Prior justifications for DMD methods

Available justifications for DMD (see [49]) and EDMD (see [58]) are based on the Koopman operator, whose basics we review in Appendix A.1 for completeness. The argument starts with the observation that special observables falling in invariant subspaces of this operator in the space of all observables obey linear dynamics. Consequently, DMD should recover the Koopman operator restricted to this subspace if the observables are taken from such a subspace.

In this sense, DMD is viewed as an approximate, continuous immersion of a nonlinear system into an infinite dimensional linear dynamical system. While such an immersion is not possible for typical nonlinear systems with multiple limit sets (see [38, 39]), one still hopes that this approximate immersion is attainable via DMD or EDMD for nonlinear systems with a single attracting steady state that satisfies appropriate nondegeneracy conditions (see [33]). In that case, unlike classic local linearization near fixed points, the linearization via DMD or EDMD is argued to be non-local, as it covers the full domain of definition of Koopman eigenfunctions spanning the underlying Koopman-invariant subspace.

However, Koopman eigenfunctions, whose existence, domain of definition and exact form are a priori unknown for general systems, are notoriously difficult–if not impossible–to determine accurately from data. More importantly, even if Koopman-invariant subspaces of the observable space were known, any countable set of generically chosen observables would lie outside those subspaces with probability one. As a consequence, DMD eigenvectors (which are generally argued to be approximations of Koopman eigenfunctions and can be used to compute Koopman modes1) would also lie outside Koopman-invariant subspaces, given that such eigenvectors are just linear combinations of the available observables. Consequently, practically observed data sets would fall under the realm of Koopman-based explanation for DMD with probability zero. This is equally true for EDMD, whose flexibility in choosing the function set $K (ϕ (x))$ of observables also introduces further user-dependent heuristics beyond the dimension d of the DMD.

One may still hope that by enlarging the dimension d of observables in DMD and enlarging the function library $K (ϕ (x))$ in EDMD, the optimization involved in these methods brings DMD and EDMD eigenvectors closer and closer to Koopman eigenfunctions. The required enlargements, however, may mean hundreds or thousand of dimensions even for dynamical system governed by simple, low-dimensional ODEs [59]. These enlargements succeed in fitting linear systems closely to sets of observer trajectories, but they also unavoidably lead to overfits that give unreliable predictions for initial conditions not used in the training of DMD or EDMD. Indeed, the resulting large linear systems can perform substantially worse in prediction than much lower dimensional linear or nonlinear models obtained from other data-driven techniques (see, e.g., Alora et al. [1]).

Similar issues arise in justifying the kernel-based EDMD of Williams et al. [59] based on the Koopman operator. Additionally, the choice of the kernel function that represents the inner product of the now implicitly defined polynomial basis functions remains heuristic and problem-dependent. Again, the accuracy of the procedure is not guaranteed, as available observer data is generically not in a Koopman eigenspace. As Williams et al. [59] write: “Like most existing data-driven methods, there is no guarantee that the kernel approach will produce accurate approximations of even the leading eigenvalues and modes, but it often appears to produce useful sets of modes in practice if the kernel and truncation level of the pseudoinverse are chosen properly.”

Finally, a lesser known limitation of the Koopman-based approach to DMD is the limited domain in the phase space over which Koopman eigenfunctions (and hence their corresponding invariant subspaces) are defined in the observable space. Specifically, at least one principal Koopman eigenfunction necessarily blows up near basin boundaries of attracting and repelling fixed points and periodic orbits (see Proposition 1 of our Appendix A.4 for a precise statement and Theorem 3 of Kvalheim and Arathoon [33] for a more general related result).

Expansions of observables in terms of such blowing-up eigenfunctions have even smaller domains of convergence, as was shown explicitly in a simple example by Page and Kerswell [44]. This is a fundamental obstruction to the often envisioned concept of global linear models built of different Koopman eigenfunctions over multiple domains of attraction (see, e.g., Williams et al. [58], p. 1309). While it is broadly known that such models would be discontinuous along basin boundaries [33, 38, 39], it is rarely noted (see Kvalheim and Arathoon [33] for a rare exception) that such models would also generally blow up at those boundaries and hence would become unmanageable even before reaching the boundaries.

For these reasons, an alternative mathematical foundation for DMD is desirable. Ideally, such an approach should be defined on an equal or lower dimensional space, rather than on higher or even infinite-dimensional spaces, as suggested by the Koopman-based view on DMD. This should help in avoiding overfitting and computational difficulties. Additionally, an ideal treatment of DMD should also provide specific non-degeneracy conditions on the underlying dynamical system, on the available observables, and on the specific data to be used in the DMD procedure.

In this paper, we develop a treatment of DMD that satisfies these requirements. This enables to derive conditions for DMD to approximate the dominant linearized observable dynamics near hyperbolic fixed points and periodic orbits of finite- and infinite-dimensional dynamical systems.

Our approach to DMD also leads to a refinement of DMD which we call data-driven linearization (or DDL, for short). DDL effectively carries out exact local linearization via nonlinear coordinate changes on a lower-dimensional attracting invariant manifold (spectral submanifold) of the dynamical system. We illustrate the increased accuracy and domain of validity of DDL models relative to those obtained from DMD and EDMD on examples of autonomous and forced dynamical systems.

A simple justification for the DMD algorithm

Here we give an alternative interpretation of DMD and EDMD as approximate models for a dynamical system known through a set of observables. The main idea (to be made more precise shortly) is to view DMD executed on d observables $ϕ_{1}, \dots, ϕ_{d}$ as a model reduction tool that captures the leading-order dynamics of $n \geq d$ phase space variables along a d-dimensional slow manifold in terms of $ϕ_{1}, \dots, ϕ_{d}$ .

Such manifolds arise as slow spectral submanifolds (SSMs) under weak non-degeneracy assumptions on the linearized spectrum at stable hyperbolic fixed points of the n-dimensional dynamical system (see [10, 12, 22]). Specifically, a slow SSM $W (E)$ is tangent to the real eigenspace E spanned by the d slowest decaying linearized modes at the fixed point. If m sample trajectories ${\{x^{j}, (t)\}}_{j = 1}^{m}$ are released from a set of initial conditions ${\{x^{j}, (0)\}}_{j = 1}^{m}$ at time $t = 0$ , then due to their fast decay along the remaining fast spectral subspace F , the $x^{j} (t)$ trajectories will become exponentially close to $W (E)$ by some time $t = t_{0} \geq 0$ , and closely synchronize with its internal dynamics, as seen in Fig. 1.

Fig. 1 — The geometric meaning of DMD performed on d observables, $ϕ_{1} (x), \dots, ϕ_{d} (x)$ , after initial fast transients die out at an exponential rate in the data. DMD then identifies the leading-order (linear) dynamics on a d-dimensional attracting spectral submanifold (SSM) $W (E)$ tangent to the d-dimensional slow spectral subspace E. These linearized dynamics can be expressed in terms of the SSM-restricted observables ${φ = ϕ |}_{W (E)}$ . Also shown is the spectral subspace F of the faster decaying modes and its associated nonlinear continuation, the fast spectral subspace $W (F)$

If $W (E)$ admits a non-degenerate parametrization with respect to the observables $ϕ_{1}, \dots, ϕ_{d}$ , then one can pass to these observables as new coordinates in which to approximate the leading order, linearized dynamics inside $W (E)$ . Specifically, DMD executed over time the interval $[t_{0}, t_{0} + Δ t]$ provides the closest linear fit to the reduced dynamics of $W (E)$ from the available observable histories $ϕ_{1} (x^{j}, (t)), \dots, ϕ_{d} (x^{j}, (t))$ for $t \in [t_{0}, t_{0} + Δ t]$ , as illustrated in Fig. 1. This fit is a close representation of the actual linearized dynamics on the SSM $W (E)$ if the trajectory data ${\{x^{j}, (t)\}}_{j = 1}^{m}$ is sufficiently diverse for $t \geq t_{0}$ . The resulting DMD model with coefficient matrix $D$ will then be smoothly conjugate to the linearized reduced dynamics on $W (E)$ with an error of the order of the distance of ${\{x^{j}, (t)\}}_{j = 1}^{m}$ from $W (E)$ between the times $t_{0}$ and $t_{0} + Δ t$ .

The slow SSM $W (E)$ may also contain unstable modes in applications. Similar results hold for that case as well, provided that we select a small enough $Δ t$ , ensuring that the trajectories ${\{x^{j}, (0)\}}_{j = 1}^{m}$ are not ejected from the vicinity of the fixed point origin for $t \in [t_{0}, t_{0} + Δ t]$ . As DMD will show sensitivity with respect to the choice of $t_{0}$ and $Δ t$ in this case, we will not discuss the justification of DMD near unstable fixed points beyond Remarks 1 and 7.

In the following sections, we make this basic idea more precise both for finite and infinite-dimensional dynamical systems. This approach also reveals explicit, previously undocumented non-degeneracy conditions on the underlying dynamical system, on the available observable functions and on the specific data set used, under which DMD should give meaningful results.

Justification of DMD for continuous dynamical systems

We start by assuming that the observed dynamics take place in a domain containing a fixed point, which is assumed to be at the origin, without loss of generality, i.e.,

\begin{matrix} f (0) = 0 . \end{matrix}

We can then rewrite the dynamical system (1) in the more specific form

\begin{matrix} \dot{x} = & Ax + \tilde{f} (x), x \in R^{n}, A = D f (0), \\ \tilde{f} (x) = & o (|x|), \end{matrix}

where the classic $o (|x|)$ notation refers to the fact that ${lim}_{x \to 0} [|\tilde{f}, (x)| / |x|] = 0$ .

Most expositions of DMD methods and their variants do not state assumption (10) explicitly and hence may appear less restrictive than our treatment here. However, all of them implicitly assume the existence of such a fixed point, as all of them end up returning homogeneous linear ODEs or mappings with a fixed point at the origin. Indeed, all known applications of these methods that produce reasonable accuracy target the dynamics of ODEs or discrete maps near their fixed points.

Assumption (10) can be replaced with the existence of a limit cycle in the original system (1), in which case the first return map (or Poincaré map) defined near the limit cycle will have a fixed point. We give a separate treatment on justifying DMD as a linearization for such a Poincaré map in Sect. 3.2.

We do not advocate, however, the often used procedure of applying DMD to fit a linear system to the flow map (rather than the Poincaré map) near a stable limit cycle. Such a fit only produces the desired limiting periodic behavior if one or more of the DMD eigenvalues are artificially constrained to be on the complex unit circle by the user of DMD. This renders the DMD model both structurally unstable and conceptually inaccurate for prediction. Indeed, the model will approximate the originally observed limit cycle and convergence to it only within a measure zero, cylindrical set of its phase space. Outside this set, all trajectories of the DMD model will converge to some other member of the infinite family of periodic orbits or invariant tori within the center subspace corresponding to the unitary eigenvalues. These periodic orbits or tori have a continuous range of locations and amplitudes, and hence represent spurious asymptotic behaviors that are not seen in the original dynamical system (1).

In the special case of $d = n$ and for the special observable $Φ (x) = x$ , the near-linear form of Eq. (11) motivates the DMD procedure because a linear approximation to the system near $x = 0$ seems feasible. It is a priori unclear, however, to what extent the nonlinearities distort the linear dynamics and how DMD would account for that. Additionally, in a data-driven analysis, choosing the full phase space variable $x$ as the observable $ϕ (x)$ is generally unrealistic. For these reasons, a mathematical justification of DMD requires further assumptions, as we discuss next.

Let $λ_{1}, \dots, λ_{n} \in C$ denote the eigenvalues of $A$ and let $e_{1}, \dots, e_{n} \in C^{n}$ denote the corresponding generalized eigenvectors. We assume that at least one of the modes of the linearized system at $x = 0$ decays exponentially and there is at least one other mode that decays slower or even grows. More specifically, for some positive integer $d < n$ , we assume that the spectrum of A can be partitioned as

\begin{matrix} Re λ_{n} \leq \dots \leq Re λ_{d + 1} < Re λ_{d} \leq \dots \leq Re λ_{1} < 0 . \end{matrix}

This guarantees the existence of a d-dimensional, normally attracting slow spectral subspace

\begin{matrix} E = span \{Re e_{1}, Im e_{1}, \dots, Re e_{d}, Im e_{d}\} \end{matrix}

for the linearized dynamics, with linear decay rate towards E strictly dominating all decay rates inside E. Note that the set of vectors $Re e_{1}, Im e_{1}, \dots, Re e_{d}, Im e_{d}$ is, in general, not linearly independent, but they span a d-dimensional subspace. We also define the (real) spectral subspace of faster decaying linear modes:

\begin{matrix} F = span \{Re e_{d + 1}, Im e_{d + 1}, \dots, Re e_{n}, Im e_{n}\} . \end{matrix}

We will also use matrices containing the left and right eigenvectors of the operator $A$ and of its restrictions, ${A |}_{E}$ and ${A |}_{F}$ , to its spectral subspaces E and F, respectively. Specifically, we let

\begin{matrix} T & = [T_{E}, T_{F}], P = [\begin{matrix} P_{E} \\ P_{F} \end{matrix}], \end{matrix}

where the columns of $T_{E} \in R^{n \times d}$ are the real and imaginary parts of the generalized right eigenvectors of ${A |}_{E}$ and the columns of $T_{F} \in R^{n \times (n - d)}$ are defined analogously for ${A |}_{F} .$ Similarly, the rows of $P_{E} \in R^{d \times n}$ are the real and imaginary parts of the generalized left eigenvectors of ${A |}_{E}$ and the rows of $P_{F} \in R^{(n - d) \times n}$ are defined analogously for ${A |}_{F} .$ Under assumption (12), F is always a fast spectral subspace, containing all trajectories of the linearized system that decay faster to the origin as any trajectory in E.

We will use the notation

\begin{matrix} X = & [x^{1} (t_{0}), \dots, x^{m} (t_{0})], \\ \hat{X} = & [F^{Δ t} (x^{1} (t_{0})), \dots, F^{Δ t} (x^{m}, (t_{0}))] \end{matrix}

for trajectory data in the underlying dynamical system (1) on which the observable data matrices $Φ$ and $\hat{Φ}$ defined in (5) are defined. In truly data-driven applications, the matrices $X$ and $\hat{X}$ are not known. We will nevertheless use them to make precise statements about a required dominance of the slow linear modes of E in the available data. Such a dominance will arise for generic initial conditions if one selects the initial conditions $x^{1} (t_{0}), \dots, x^{m} (t_{0})$ after initial fast transients along F have died out. This can be practically achieved by initializing $x^{1} (t_{0}), \dots, x^{m} (t_{0})$ after a linear spectral analysis of the observable data matrix $Φ$ returns a number of dominant frequencies consistent with a d-dimensional SSM.

We now state a theorem that provides a general justification for the DMD procedure under explicit nondegeneracy conditions and with specific error bounds. Specifically, we give a minimal set of conditions under which DMD can be justified as an approximate, leading-order, d-dimensional reduced-order model for an nonlinear system of dimension $n \geq d$ near its fixed point. Based on relevance for applications, we only state Theorem 1 for stable hyperbolic fixed points, but discuss subsequently in Remark 1 its extension to unstable fixed points.

Theorem 1

(Justification of DMD for ODEs with stable hyperbolic fixed points) Assume that

The origin, $x = 0$ , is a stable hyperbolic fixed point of system (11) with a spectral gap, i.e., the spectrum $Spect [A]$ satisfies Eq. (12).
$f \in C^{2}$ in a neighborhood of the origin.
For some integer $d \in [1, n]$ , a d-dimensional observable function $ϕ \in C^{2}$ and the d-dimensional slow spectral subspace E of the hyperbolic fixed point $x = 0$ of system (11) satisfy the non-degeneracy condition
$\begin{matrix} rank [D, ϕ, (0), |_{E}] = d . \end{matrix}$ 17
The data matrices $Φ$ and $\hat{Φ}$ are non-degenerate and the initial conditions in $X$ and $\hat{X}$ have been selected after fast transients from the modes outside E have largely died out, i.e.,
$\begin{matrix} {rank}_{row} Φ = d, |P_{F}, X|, |P_{F}, \hat{X}| \leq {|P_{E}, X|}^{1 + β}, \end{matrix}$ 18
for some $β \in (0, 1]$ .

Then the DMD computed from $Φ$ and $\hat{Φ}$ yields a matrix $D$ that is locally topologically conjugate with order $O ({|P_{E}, X|}^{β})$ error to the linearized dynamics on a d-dimensional, slow attracting spectral submanifold $W (E) \in C^{1}$ tangent to E at $x = 0$ . Specifically, we have

\begin{matrix} D = D ϕ (0) T_{E} e^{Λ_{E} Δ t} {(D, ϕ, (0), T_{E})}^{- 1} + O ({|P_{E}, X|}^{β}) . \end{matrix}

Proof

Under assumptions (A1) and (A2), any trajectory in a neighborhood of the origin in the nonlinear system (11) converges at an exponential rate $e^{Re λ_{d + 1} t}$ to a d-dimensional attracting spectral submanifold $W (E)$ tangent to a d-dimensional attracting slow spectral subspace E of the linearized system at the origin. This follows from the $C^{1}$ linearization theorem of Hartman [23], which is applicable to $C^{2}$ dynamical systems with a stable hyperbolic fixed point. Under assumption (A3), the d-dimensional observable function $ϕ (x)$ restricted to $W (E)$ can be used to parametrize $W (E)$ near the origin, and hence a d-dimensional, self-contained nonlinear dynamical system can be written down for the restricted observable ${φ = ϕ |}_{W (E)}$ along $W (E)$ . Under the first assumption in (A4), the available observational data matrices $Φ$ and $\hat{Φ}$ are rich enough to characterize the reduced dynamics on $W (E)$ . Under the second assumption in (A4), transients from the faster modes outside E have largely died out before the selection of the initial conditions in $X$ , so that the linear part of the dynamics on $W (E)$ can be approximately inferred from $Φ$ and $\hat{Φ}$ . In that case, up to an error proportional to the distance of the training data from $W (E)$ , the matrix $D \in R^{d \times d}$ returned by DMD is similar to the time- $Δ t$ flow map of the linearized flow of the underlying dynamical system restricted to $W (E)$ . This linearized flow then acts as a local reduced-order model with which nearby trajectory observations synchronize exponentially fast in the observable space. We give a more detailed proof of the theorem in Appendix (B). $□$

Remark 1

In Theorem 1, we can replace assumption (A1) with

\begin{matrix} Re λ_{n} \leq . . . \leq Re λ_{d + 1} < 0, \\ Re λ_{d + 1} < Re λ_{d} \leq . . . \leq Re λ_{1}, \\ Re λ_{j} \neq 0, j = 1, . . ., n \end{matrix}

This means that the $x = 0$ fixed point is only assumed hyperbolic with a spectral gap and $A$ has an attracting d-dimensional spectral subspace E that possibly contains some instabilities, i.e., eigenvalues with positive real parts. Then the statements of Theorem 1 still hold, but $W (E)$ will be only be guaranteed $C^{1}$ at $x = 0$ and Hölder-continuous at other points near the fixed point. This follows by replacing the linearization theorem of Hartman [23] with that of van Strien [56], which still enables us to use Eq. (79) in the proof. Therefore, slow subspaces E containing a mixture of stable and unstable modes can also be allowed, as long as F contains only fast modes consistent with the splitting assume in Eq. (12). In that case, however, the time $t_{0} + Δ t$ must be chosen carefully to ensure that $|P_{F}, \hat{X}| \leq {|P_{E}, X|}^{1 + β}$ still holds, i.e., the data used in DMD still samples a neighborhood of the origin.

Remark 2

In related work, Bollt et al. [7] construct the transformation relating a pair of conjugate dynamical systems based on a limited set of matching Koopman eigenfunctions, which are either known explicitly or constructed from EDMD with dictionary learning (EDMD-DL; see [36]). In principle, this could be used to construct linearizing transformation as well. However, even when the eigenfunctions are approximated from data, the approach assumes that the linearized system, as well as a linearized trajectory and its preimage under the linearization, are available. As these assumptions are not satisfied in practice, only very simple and low-dimensional analytic examples are treated by Bollt et al. [7].

In Appendix (B), Remarks 4 and 5 summarize technical points on the application and possible further extensions of Theorem 1. In practice, Theorem 1 provides previously unspecified non-degeneracy conditions on the linear part of the dynamical system to be analyzed via DMD (assumption (A1)), on the regularity of the nonlinear part of the system (assumption (A2)), on the type of observables available for the analysis (assumption (A3)) and on the specific observable data used in the analysis (assumption (A4)). The latter assumption requires that there have to be at least as many independent observations in time as observables. This specifically excludes the popular use of tall $Φ$ observable data matrices which provide more free parameters to pattern-match observational data but will also lead to an overfit that diminishes the predictive power of the DMD model on initial conditions not used in its training.

To illustrate these points, we demonstrate the necessity of assumptions (A2)–(A4) of Theorem 1 in Appendix C on simple examples.

Justification of DMD for discrete and for time-periodic continuous dynamical systems

The linearization results we have applied to deduce Theorem 1 are equally valid for discrete dynamical systems defined by iterated mappings. Such mappings are of the form

\begin{matrix} x_{n + 1} = & f (x_{n}) = A x_{n} + \tilde{f} (x_{n}), x_{j} \in R^{n}, \\ A \in R^{n \times n}, \tilde{f} (x) = o (|x|) . \end{matrix}

We will use a similar ordering for the eigenvalues of $A$ as in the continuous time case:

\begin{matrix} |λ_{n}| \leq \dots \leq |λ_{d + 1}| < |λ_{d}| \leq \dots \leq |λ_{1}| < 1 . \end{matrix}

As in the continuous time case, we will use the observable data matrices

\begin{matrix} Φ = ϕ (X), \hat{Φ} = ϕ (f, (X)), \end{matrix}

with the initial conditions for the map $f$ stored in $X$ .

With these ingredients, we need only minor modifications in the assumptions of the theorems that account for the usual differences between the spectrum of an ODE and a map.

Theorem 2

(Justification of DMD for maps with stable hyperbolic fixed points) Assume that

$x = 0$ is a stable hyperbolic fixed point of system (21), i.e., assumption (22) holds.
In Eq. (21), $\tilde{f} \in C^{2}$ holds in a neighborhood of the origin.
For some integer $d \in [1, n]$ , a d-dimensional observable function $ϕ \in C^{2}$ and the d-dimensional slow spectral subspace E of the hyperbolic fixed point $x = 0$ of system (21) satisfy the non-degeneracy condition
$\begin{matrix} rank [D, ϕ, (0), |_{E}] = d . \end{matrix}$ 24
The data matrices $Φ$ and $\hat{Φ}$ collected from iterations of system (21) are non-degenerate and are dominated by data near E, i.e.,
$\begin{matrix} {rank}_{row} Φ = d, \\ |P_{F}, X|, |P_{F}, \hat{X}| \leq {|P_{E}, X|}^{1 + β}, \end{matrix}$ 25
for some $β \in (0, 1)$ .

Then the DMD computed from $Φ$ and $\hat{Φ}$ yields a matrix $D$ that is locally topologically conjugate with order $O ({|P_{E}, X|}^{β})$ error to the linearized dynamics on a d-dimensional attracting spectral submanifold $W (E) \in C^{1}$ tangent to E at $x = 0$ . Specifically, we have

\begin{matrix} D = D ϕ (0) T_{E} Λ_{E} {(D, ϕ, (0), T_{E})}^{- 1} + O ({|P_{E}, X|}^{β}) . \end{matrix}

The spectral submanifold $W (E)$ and its reduced dynamics are of class $C^{1}$ at the origin, and at least Hölder continuous in a neighborhood of the origin.

Proof

The proof is identical to the proof of Theorem 1 but uses the discrete version of the linearization result by Hartman [23] for stable hyperbolic fixed points of maps.

Theorem 1 can be immediately applied to justify DMD as a linearization tool for period-one maps (or Poincaré maps) of time-periodic, non-autonomous dynamical systems near their periodic orbits. This requires the data matrices $Φ$ and $\hat{Φ}$ to contain trajectories of such a Poincaré map. Remark 8 on the treatment of slow spectral subspaces E containing possible instabilities also applies here under the modified assumption

\begin{matrix} |λ_{n}| \leq \dots \leq |λ_{d + 1}| < 1, \\ |λ_{d + 1}| < |λ_{d}| \leq \dots \leq |λ_{1}|, \\ |λ_{j}| \neq 1, j = 1, \dots, n, \end{matrix}

which only requires the fixed point to be hyperbolic and $A$ to have a d-dimensional normally attracting subspace.

Justification of DMD for infinite-dimensional dynamical systems

Most data sets of interest arguably arise from infinite-dimensional dynamical systems of fluids and solids. Examples include experimental or numerical data describing fluid motion, continuum vibrations, climate dynamics or salinity distribution in the ocean. In the absence of external forcing, these problems are governed by systems of autonomous nonlinear partial differential equations that can often be viewed as evolutionary differential equations in a form similar to Eq. (1), but defined on an appropriate infinite-dimensional Banach space. Accordingly, time-sampled solutions of these equations can be viewed as iterated mappings of the form (21) but defined on Banach spaces.

Our approach to justifying DMD generally carries over to this infinite-dimensional setting, as long as the observable vector $ϕ (x)$ remains finite-dimensional, and both the Banach space and the discrete or continuous dynamical system defined on it satisfy appropriate regularity conditions. These regularity conditions tend to be technical, but when they are satisfied, they do guarantee the extension of Theorems 1 and 2 to Banach spaces. This offers a justification to use DMD to obtain an approximate finite-dimensional linear model for the dynamics of the underlying continuum system on a finite-dimensional attracting slow manifold (or inertial manifold) in the neighborhood of a non-degenerate stationary solution.

To avoid major technicalities, we only state here a generalized version of Theorem 1 to justify the use of DMD for observables defined on Banach spaces for a discrete evolutionary process with a stable hyperbolic stationary state. We consider mappings of the form

\begin{matrix} x_{n + 1} = & f (x_{n}) = A x_{n} + \tilde{f} (x_{n}), x_{j} \in B, \\ \tilde{f} : U \subset B \to B, \tilde{f} (0) = 0 \in U, \end{matrix}

where $B$ is a Banach space, U is an open set in $B$ , and $A : B \to B$ is an invertible linear operator that is bounded in the norm defined on $B$ . The function $f$ can be here the time-sampled version of an infinite-dimensional flow map of an autonomous evolutionary PDE or the Poincaré map of a time-periodic evolutionary PDE. We assume that for some $α \in (0, 1)$ , $\tilde{f} \in C^{1, α} (U)$ holds, i.e., $\tilde{f}$ is (Fréchet-) differentiable in U and its derivative, $D \tilde{f}$ , is Hölder-continuous in $x \in U$ with Hölder exponent $α$ .

The spectral radius of A is defined as

\begin{matrix} ρ (A) = lim_{k \to \infty} {|A^{k}|}^{\frac{1}{k}} . \end{matrix}

We recall that in the special case $B = R^{n}$ treated in Sect. 3.2, we have $ρ (A) = {max}_{1 \leq j \leq n} |λ_{j}|$ . For some $α \in (0, 1)$ , the linear operator $A$ is called $α$ -contracting if

\begin{matrix} ρ {(A)}^{1 + α} ρ (A^{- 1}) < 1, \end{matrix}

which can only hold if $ρ (A) < 1$ (see [41]). Therefore, in the simple case of $B = R^{n}$ , A is $α$ -contracting if it is a contraction (i.e., all its eigenvalues are less than one in norm) and

\begin{matrix} {|λ_{1}|}^{1 + α} < |λ_{n}|, \end{matrix}

showing that the spectrum of $A$ is confined to an annulus of outer radius $|λ_{1}| < 1$ and inner radius ${|λ_{1}|}^{1 + α}$ . We can now state our main result on the justification of DMD for infinite-dimensional discrete dynamical systems.

Theorem 3

(Justification of DMD for infinite-dimensional maps with stable hyperbolic fixed points) Assume that

For some $α \in (0, 1)$ , the linear operator $A$ is $α$ -contracting (and hence the $x = 0$ fixed point of system (28) is linearly stable).
In Eq. (28), $\tilde{f} \in C^{1, α} (U)$ holds in a U neighborhood of the origin.
For some integer $d \in N^{+}$ , there is a splitting $B = E \oplus F$ of $B$ into two A-invariant subspaces $E, F \subset B$ such that E is d-dimensional and slow, i.e.,
$\begin{matrix} ρ ({A |}_{E}) < \frac{1}{ρ (A^{- 1}, |_{F})} \end{matrix}$ 30
Furthermore, a d-dimensional observable function $ϕ \in C^{2}$ satisfies the non-degeneracy condition
$\begin{matrix} rank [D, ϕ, (0), |_{E}] = d . \end{matrix}$ 31
The data matrices $Φ$ and $\hat{Φ}$ collected from iterations of system (28) are non-degenerate and are dominated by data near E, i.e.,
$\begin{matrix} {rank}_{row} Φ = d, \\ |P_{F}, X|, |P_{F}, \hat{X}| \leq {|P_{E}, X|}^{1 + β}, \end{matrix}$ 32
for some $β \in (0, 1)$ .

Then the DMD computed from $Φ$ and $\hat{Φ}$ yields a matrix $D$ that is locally topologically conjugate with order $O ({|P_{E}, X|}^{β})$ error to the linearized dynamics on a d-dimensional attracting spectral submanifold $W (E)$ tangent to E at $x = 0$ . Specifically, we have

\begin{matrix} D = D ϕ (0) T_{E} Λ_{E} {(D, ϕ, (0), T_{E})}^{- 1} + O ({|P_{E}, X|}^{β}) . \end{matrix}

The spectral submanifold $W (E)$ and its reduced dynamics are of class $C^{1}$ in a neighborhood of the origin.

Proof

The proof follows the steps in the proof of Theorem 2 but uses an infinite-dimensional linearization result, Theorem 3.1 of Newhouse [41], for stable hyperbolic fixed points of maps on Banach spaces. Specifically, if $A$ is $α$ -contracting, then Newhouse [41] shows the existence of a near-identity linearizing transformation $x = y + h (y)$ for the discrete dynamical system (28) such that $h \in C^{1, α} (B)$ holds on a small enough ball $B \subset U$ centered at $x = 0$ . Using this linearization theorem instead of its finite-dimensional version from Hartman [23], we can follow the same steps as in the proof of Theorem 2 to conclude the statement of the theorem. $□$

In Appendix B, Remarks 6 and 7 summarize technical remarks on possible further extensions of Theorem 3.

Data-driven linearization (DDL)

Theoretical foundation for DDL

Based on the results of the previous section, we now refine the first-order approximation to the linearized dynamics yielded by DMD near a hyperbolic fixed point. Specifically, we construct the specific nonlinear coordinate change that linearizes the restricted dynamics on the attracting spectral submanifold $W (E)$ illustrated in Fig. 1. This classic notion of linearization on $W (E)$ yields a d-dimensional linear reduced model, which can be of significantly lower dimension than the original n-dimensional nonlinear system. This is to be contrasted with the broadly pursued Koopman embedding approach (see, e.g., [8, 40, 49]), which seeks to immerse nonlinear systems into linear systems of dimensions substantially higher (or even infinite) relative to n.

The following result gives the theoretical basis for our subsequent data-driven linearization (DDL) algorithm. We will use the notation $C^{a}$ for the class of real analytic functions. We also use the notation $⌊x⌋$ to denote the integer part of x.

Theorem 4

(DDL principle for ODEs with a stable hyperbolic fixed points) Assume that the origin, $x = 0$ is a stable hyperbolic fixed point of system (11) and the spectrum of $A$ has a spectral gap as in Eq. (12). Assume further that for some $r \in N^{+} \cup \{\infty, a\}$ , the following conditions are satisfied:

$f_{2} \in C^{r}$ and the nonresonance conditions
$\begin{matrix} λ_{k} \neq & \sum_{j = 1}^{n} m_{j} λ_{j}, m_{j} \in N, k = 1, . . ., n, \\ 2 \leq & \sum_{j = 1}^{n} m_{j} \leq Q \leq r, \\ Q : = & ⌊\frac{{max}_{i} |Re, λ_{i}|}{{min}_{i} |Re, λ_{i}|}⌋ + 1, \end{matrix}$ 34
hold for the eigenvalues of $A$ .
For some integer $d \in [1, n]$ , a d-dimensional observable function $ϕ \in C^{r}$ and the d-dimensional slow spectral subspace E of the stable fixed point $x = 0$ of system (11) satisfy the non-degeneracy condition.
$\begin{matrix} rank [D, ϕ, (0), |_{E}] = d . \end{matrix}$ 35

Then the following hold:

(i)
On the unique d-dimensional attracting spectral submanifold $W (E) \in C^{r}$ tangent to E at $x = 0$ , the reduced observable vector ${φ = ϕ |}_{W (E)}$ can be used to describe the reduced dynamics as
$\begin{matrix} \dot{φ} = & B φ + q (φ), \\ B = & D ϕ (0) T_{E} Λ_{E} {(D, ϕ, (0), T_{E})}^{- 1}, \\ q (φ) = & O ({|φ|}^{2}) . \end{matrix}$ 36
(ii)
There exists a unique, $C^{r}$ change of coordinates
$\begin{matrix} φ = κ (γ) = γ + ℓ (γ), \end{matrix}$ 37
that transforms the reduced dynamics on $W (E)$ to its linearization
$\begin{matrix} \dot{γ} = B γ \end{matrix}$ 38
inside the domain of attraction of $x = 0$ within the spectral submanifold $W (E) .$
(iii)
The transformation (37) satisfies the d-dimensional system of nonlinear PDEs
$\begin{matrix} D_{γ} ℓ (γ) B γ = B ℓ (γ) + q (γ + ℓ (γ)) . \end{matrix}$ 39
If $r \in N^{+} \cup \{\infty\}$ , solutions of this PDE can locally be approximated as
$\begin{matrix} ℓ (γ) = & \sum_{|k| = 2}^{r} l_{k} γ^{k} + o ({|γ|}^{r}), k \in N^{d}, l_{k} \in R^{d}, \\ γ^{k} : = & γ_{1}^{1} \dots γ_{d}^{d} . \end{matrix}$ 40
If $r = a$ , then the local approximation (40) can be refined to a convergent Taylor series
$\begin{matrix} ℓ (γ) = & \sum_{|k| = 2}^{\infty} l_{k} γ^{k}, k \in N^{d}, l_{k} \in R^{d}, \\ γ^{k} : = & γ_{1}^{1} \dots γ_{d}^{d} \end{matrix}$ 41
in a neighborhood of the origin. In either case, the coefficients $l_{k}$ can be determined by substituting the expansion for $ℓ (γ)$ into the PDE (39), equating coefficients of equal monomials $γ^{k}$ and solving the corresponding recursive sequence of d-dimensional linear algebraic equations for increasing $|k|$ .

Proof

The proof builds on the existence of the d-dimensional spectral submanifold $W (E)$ guaranteed by Theorem 1. For a $C^{r}$ dynamical system with $r \in N^{+} \cup \{\infty, a\}$ , $W (E)$ is also $C^{r}$ smooth based on the linearization theorems of Poincaré [46] and Sternberg [52], as long as the nonresonance condition (34) holds. Condition (35) then ensures that $W (E)$ can be parametrized locally by the restricted observable vector $φ$ and hence its reduced dynamics can be written as a nonlinear ODE for $φ$ . This ODE can again be linearized by a near-identity coordinate change (37) using the appropriate linearization theorem of the two cited above. The result is the restricted linear system (38) to which the dynamics is $C^{r}$ conjugate within the whole domain of attraction of the $φ = 0$ fixed point inside $W (E)$ . The invariance PDE (39) can be obtained by substituting the linearizing transformation (37) into the reduced dynamics on $W (E)$ . This PDE can then be solved via a Taylor expansion up to order r. We give more a more detailed proof in Appendix D. $□$

Note that 1:1 resonances are not excluded by the condition (34), and hence repeated eigenvalues arising from symmetries in physical systems are still amenable to DDL. Also of note is that the non-resonance conditions (34) do not exclude frequency-type resonances among imaginary parts of oscillatory eigenvalues. Rather, they exclude simultaneous resonances of the same type between the real and the imaginary parts of the eigenvalues. Such resonances will be absent in data generated by generic oscillatory systems.

Assuming hyperbolicity is essential for Theorem 4 to hold, since in this case the linearization is the same as transforming the dynamics to the Poincaré-normal form. For a non-hyperbolic fixed point, this normal form transformation results in nonlinear dynamics on the center manifold. This would, however, only arise in highly non-generic systems, precisely tuned to be at criticality. Since this is unlikely to happen in experimentally observed or numerically simulated systems, the hyperbolicity assumption is not restrictive.

Finally, under the conditions of Theorem 3, the DDL results of Theorem 4 also apply to data from infinite-dimensional dynamical systems, such as the fluid sloshing experiments we will analyze using DDL in Sect. 5.4. In practice, the most restrictive condition of Theorem 3 is (A1), which requires the solution operator to have a spectrum uniformly bounded away from zero. Such uniform boundedness is formally violated in important classes of infinite-dimensional evolution equations, presenting a technical challenge for the direct applications of SSM results to certain delay-differential equations (see [54]) and partial differential equations (see, e.g., [9, 30]). However, this challenge only concerns rigorous conclusions on the existence and smoothness of a finite-dimensional, attracting SSM. If the existence of such an SSM is convincingly established from an alternative mathematical theory (as is [9]) or inferred from data (as in [54]), then the DDL algorithm based on Theorem 4 can be used to obtain a data-driven linearization of the dynamics on that SSM.

DDL versus EDMD

Here we examine whether there is a possible relationship between DDL and the extended DMD (or EDMD) algorithm of Williams et al. [58]. For simplicity, we assume analyticity for the dynamical system ( $r = a$ ) and hence we can write the inverse of the linearizing transformation (40) behind the DDL algorithm as a convergent Taylor expansion of the form

\begin{matrix} γ = κ^{- 1} (φ) = φ + \sum_{|k| = 2}^{\infty} q_{k} φ^{k} . \end{matrix}

We then differentiate this equation in time to obtain from the linearized equation (38) a d-dimensional system of equations

\begin{matrix} \dot{φ} + \sum_{|k| = 2}^{\infty} q_{k} \frac{d}{dt} φ^{k} = B φ + \sum_{|k| = 2}^{\infty} B q_{k} φ^{k} \end{matrix}

that the restricted observable $φ$ and its monomials $φ^{k}$ must satisfy. This last equation can be rewritten as a d-dimensional autonomous system of linear system of ODEs,

\begin{matrix} [\begin{matrix} I_{d \times d} & Q_{2} \end{matrix}] \frac{d}{dt} [\begin{matrix} φ \\ K_{\geq 2} (φ) \end{matrix}] = [\begin{matrix} B & B Q_{2} \end{matrix}] [\begin{matrix} φ \\ K_{\geq 2} (φ) \end{matrix}], \end{matrix}

for the reduced observable $φ$ and the infinite-dimensional vector $K_{\geq 2} (φ)$ of all nonlinear monomials of $φ$ . Here $I_{d \times d}$ denotes the d-dimensional identity matrix and $Q_{2}$ contains all coefficients $q_{k}$ as column vectors starting from order $|k| = 2$ .

If we truncate the infinite-dimensional vector of monomials $K_{\geq 2} (φ)$ to the vector $K_{2}^{k} (φ)$ of nonlinear monomials up to order k, then Eq. (43) becomes

\begin{matrix} [\begin{matrix} I_{d \times d} & Q_{2}^{k} \end{matrix}] \frac{d}{dt} [\begin{matrix} φ \\ K_{2}^{k} (φ) \end{matrix}] = [\begin{matrix} B & B Q_{2}^{k} \end{matrix}] [\begin{matrix} φ \\ K_{2}^{k} (φ) \end{matrix}] . \end{matrix}

This is a d-dimensional implicit system of linear ODEs for the dependent variable vector $(φ, K_{2}^{k} (φ))$ whose dimension is always larger than d. Consequently, the operator $[\begin{matrix} I_{d \times d} & Q_{2}^{k} \end{matrix}]$ is never invertible and hence, contrary to the assumption of EDMD, there is no well-defined linear system of ODEs that governs the evolution of an observable vector and the monomials of its components.

The above conclusion remains unchanged even if one attempts to optimize with respect to the choice of the coefficients $q_{k}$ in the matrix $Q_{2}^{k}$ .

Implementation and applications of DDL

Basic implementation of DDL for model reduction and linearization

Theorem 4 allows us to define a numerical procedure to construct a linearizing transformation on the d-dimensional attracting slow manifold $W (E)$ systematically from data. From Eq. (44), the matrices $B$ and $Q$ are to be determined, given a set of observed trajectories. In line with the notation used in Sect. 4.2, let the data matrix $K_{2}^{k} (φ)$ contain monomials (from order 2 to order k) of the observable vector $φ$ and let $\hat{K_{2}^{k}} (φ)$ contain denote the evaluation of $K_{2}^{k} (φ)$ time $Δ t$ later. Passing to the discrete version of the invariance equation (44), we obtain

\begin{matrix} [\begin{matrix} I & Q \end{matrix}] [\begin{matrix} \hat{φ} \\ \hat{K_{2}^{k}} (φ) \end{matrix}] = [\begin{matrix} B & B Q \end{matrix}] [\begin{matrix} φ \\ K_{2}^{k} (φ) \end{matrix}] \end{matrix}

for some matrices $Q \in R^{d \times N (d, k) - d}$ and $B = e^{B Δ t} \in R^{d \times d}$ . Moreover, the inverse transformation of the linearization on the SSM $W (E)$ is well-defined, and hence with an appropriate matrix $Q^{inv} \in R^{d \times N (d, k) - d}$ , we can write

\begin{matrix} [\begin{matrix} I & Q^{inv} \end{matrix}] [\begin{matrix} φ + Q K_{2}^{k} (φ) \\ K_{2}^{k} (φ + Q K_{2}^{k} (φ)) \end{matrix}] = φ . \end{matrix}

This allows us to define the cost functions

\begin{matrix} L^{(1)} (Q, B) & = |[\begin{matrix} I & Q \end{matrix}], [\begin{matrix} \hat{φ} \\ \hat{K_{2}^{k}} (φ) \end{matrix}]) \\ {(- [\begin{matrix} B & B Q \end{matrix}] [\begin{matrix} φ \\ K_{2}^{k} (φ) \end{matrix}]|}^{2}, \\ L^{(2)} (Q, Q^{inv}) & = |Q, K_{2}^{k}, (φ)) \\ {(+ Q^{inv} K_{2}^{k} (φ + Q K_{2}^{k} (φ))|}^{2}, \end{matrix}

where $L^{(1)}$ measures the invariance error along the observed trajectories and $L^{(2)}$ measures the error due to the computation of the inverse. We aim to jointly minimize $L^{(1)}$ and $L^{(2)}$ . To this end, we define the combined cost function

\begin{matrix} L_{ν} (Q, Q^{inv}, B) = & L^{(1)} (Q, B) \\ + ν L^{(2)} (Q, Q^{inv}), \end{matrix}

for some $ν \geq 0$ . In our examples, we choose $ν = 1$ , which puts the same weight on both terms in the cost function (46). Minimizers of $L_{ν}$ provide optimal solutions to the DDL principle and can be written as

\begin{matrix} (Q^{⋆}, Q^{i n v, ⋆} B^{⋆}) = \underset{Q, Q^{inv}, B}{argmin} L_{ν} (Q, Q^{inv}, B), \end{matrix}

or, equivalently, as solutions of the system of equations

\begin{matrix} \frac{\partial L_{ν}}{\partial Q_{ij}} & = 0 i = 1, . . ., d, j = 1, . . ., N (d, k) - d, \end{matrix}

\begin{matrix} \frac{\partial L_{ν}}{\partial Q_{ij}^{inv}} & = 0, i = 1, . . ., d, j = 1, . . ., N (d, k) - d, \\ \frac{\partial L_{ν}}{\partial B_{ij}} & = 0 i, j = 1, . . ., d . \end{matrix}

The optimal solution (47) does not necessarily coincide with the Taylor-coefficients of the linearizing transformation (41). Instead of giving the best local approximation, $(Q^{⋆}, Q^{i n v, ⋆} B^{⋆})$ approximates the linearizing transformation and the linear dynamics in a least-squares sense over the domain of the training data. This means that DDL is not hindered by the convergence properties of the analytic linearization. Note that for $d = 1$ , one can estimate the radius of convergence of (41), for example, by constructing the Domb–Skyes plot (see [15]), or by finding the radius of the circle in the complex plane onto which the roots of the truncated expansion accumulate under increasing orders of truncation (see [25, 47]). For $d > 1$ , such analysis is more difficult, since multivariate Taylor-series have more complicated domains of convergence. In our numerical examples, we estimate the domain of convergence of such analytic linearizations as the domain on which $κ \circ κ^{- 1} = I$ holds to a good approximation. As we will see, this domain of convergence may be substantially smaller the domain of validity of transformations determined in a fully data-driven way.

Since the cost function (45) is not convex, the optimization problem (47) has to be solved iteratively starting from an initial guess $(Q_{0}, Q_{0}^{inv}, B_{0})$ . For the examples presented in the paper, we use the Levenberg–Marquardt algorithm (see [4]), but other nonlinear optimization methods, such as gradient descent or Adam (see [29]) could also be used. For our implementation, which is available from the repository [28], we used the Scipy and Pytorch libraries of Virtanen et al. [57], Paszke et al. [45]. In summary, we will use the following Algorithm 1 in our examples for model reduction via DDL.

Remark 3

The expressions (45)–(46) define one of the possible choices for the cost function. With $ν = 0$ , (46) simply corresponds to a one-step-ahead prediction with the linearized dynamics. Alternatively, a multi-step prediction can also be enforced. For a training trajectory $φ (t),$ the invariance

\begin{matrix} [\begin{matrix} I & Q \end{matrix}] [\begin{matrix} φ \\ K_{2}^{k} (φ) \end{matrix}] = [\begin{matrix} B^{1 : m} & B^{1 : m} Q \end{matrix}] [\begin{matrix} φ (0) \\ K_{2}^{k} (φ (0)) \end{matrix}] \end{matrix}

could be required, where $B^{1 : m}$ is a tensor composed of powers of the linear map $B$ . Optimizing over the entire trajectory is, however, more costly than simply minimizing (45), and we found no noticeable improvement in accuracy in our numerical examples.

Relationship with DMD implementations

Note that setting $Q = Q^{inv} = 0$ in the optimization problem (47) turns the problem into DMD. In this case, the usual DMD algorithm surveyed in the Introduction returns

\begin{matrix} B_{0} = \underset{B}{argmin} L_{0} (0, 0, B), \end{matrix}

which is a good initial guess for the non-convex optimization problem (47). More importantly, since Theorem 4 guarantees the existence of a near-identity linearizing transformation, we expect that the true minimizer is close to the DMD-solution. Therefore, we may explicitly expand the cost function (45) around the DMD solution as

\begin{matrix} L_{ν} (Q, Q^{inv}, B) = L_{ν} (0, 0, B_{0}) \\ + D L_{(0, 0, B_{0})} \cdot (\begin{matrix} Q \\ Q^{inv} \\ B - B_{0} \end{matrix}) \\ + \frac{1}{2} [D^{2} L_{(0, 0, B_{0})} \cdot (\begin{matrix} Q \\ Q^{inv} \\ B - B_{0} \end{matrix})] \cdot (\begin{matrix} Q \\ Q^{inv} \\ B - B_{0} \end{matrix}) \\ + ({|Q|}^{3}, {|Q^{inv}|}^{3}, {|B - B_{0}|}^{3}), \end{matrix}

where $D L_{(0, 0, B_{0})}$ and $D^{2} L_{(0, 0, B_{0})}$ are the Jacobian and the Hessian of the cost function evaluated at the DMD solution, respectively. Since the Jacobian is nonsingular at the DMD solution, the minimum of the quadratic approximation of the cost function satisfies the linear equation

\begin{matrix} - D L_{(0, 0, B_{0})} = D^{2} L_{(0, 0, B_{0})} (\begin{matrix} Q \\ Q^{inv} \\ B - B_{0} \end{matrix}) . \end{matrix}

This serves as the first-order correction to the DMD-solution in the DDL procedure. The Eq. (50) is explicitly solvable and is equivalent to performing a single Levenberg–Marquardt step on the non-convex cost function (45), with the DMD solution $(0, 0, B_{0})$ serving as an initial guess.

Minimization of (45) leads to a non-convex optimization problem. Besides computing the leading-order approximation (50), a possible workaround to this challenge is to carry out the linearization in two steps. First, one can fit a polynomial map to the reduced dynamics by linear regression. Then, if the reduced-dynamics is non-resonant, it can be analytically linearized. Axås et al. [3] follow this approach to automatically find the extended normal form style reduced dynamics on SSMs using the implementation of SSM Tool by Jain et al. [24]. Although this procedure does convert the DDL principle into a convex problem, the drawback is that the linearization is obtained as a Taylor-expansion, with possibly limited convergence properties.

Using DDL to construct spectral foliations

The mathematical foundation of SSM-reduced modeling is that any trajectory converging to a slow SSM is guaranteed to synchronize up to an exponentially decaying error with one of the trajectories on the SSM. This follows from the general theory of invariant foliations by Fenichel [18], when applied to the d-dimensional normally hyperbolic invariant manifold $W (E)$ .2 The main result of the theory is that off-SSM initial conditions synchronizing with the same on-SSM trajectory turn out to form a class $C^{r - 1}$ smooth, $(n - d)$ -dimensional manifold, denoted $F_{p}$ , which intersects $W (E)$ in a unique point $p \in W (E)$ . The manifold $F_{p}$ is called the stable fiber emanating from the base point $p$ . Fenichel proves that any off-SSM trajectory $x (t ; x_{0})$ with initial condition $x_{0} \in F_{p_{0}}$ converges to the specific on-SSM trajectory $p (t ; p_{0}) \in W (E)$ with initial condition $p_{0} \in W (E)$ faster than any other nearby trajectory might converge to $p (t ; p_{0}) .$ Recently, Szalai [55] studied this foliation in more detail under the name “invariant spectral foliation”, discussed its uniqueness in an appropriate smoothness class and proposed its use in model reduction.

To predict the evolution of a specific, off-SSM initial condition $x_{0}$ up to time t from an SSM-based model, we first need to relate that initial condition to the base point $p_{0}$ of the stable fiber $F_{p_{0}}$ . Next, we need to run the SSM-based reduced model up to time t to obtain $p (t ; p_{0})$ . Based on the exponentially fast convergence of the full solution $x (t ; x_{0})$ to the SSM-reduced solution $p (t ; p_{0}),$ we obtain an accurate longer-term prediction for $x (t ; x_{0})$ using this procedure. Such a longer-term prediction is helpful, for instance, when we wish to predict steady states, such as fixed points and limit cycles, from the SSM-reduced dynamics.

Constructing this spectral foliation directly from data, however, is challenging for nonlinear systems. Indeed, one would need a very large number of initial conditions that cover uniformly a whole open neighborhood of the fixed point in the phase space. For example, while one or two training trajectories are generally sufficient to infer accurate SSM-reduced models even for very high-dimensional systems (see e.g., [3, 12, 13]), thousands of uniformly distributed initial conditions in a whole open set of a fixed point are required to infer accurate spectral foliation-based models even for low-dimensional systems (see [55]). The latter number and distribution of initial conditions is unrealistic to acquire in a truly data-driven setting.

To avoid constructing the full foliation, one may simply project an initial condition $x_{0}$ orthogonally to an observed spectral submanifold $W (E)$ to obtain $p_{0}$ , but this may result in large errors if E and F are not orthogonal. In that case, $W (E)$ may divert substantially from E (see [42, 43, 48] for a discussion of the limitations of this projection for general invariant manifolds).

A better solution is to project $x_{0}$ orthogonally to the slow spectral subspace E over which $W (E)$ is a graph in an (often large) neighborhood of the fixed point. This approach assumes that E and F are nearly normal and $W (E)$ is nearly flat. As the latter is typically the case for delay-embedded observables [2], orthogonal projection onto E has been the choice so far in data-driven SSM-based reduction via the SSMLearn algorithm [11]. This approach has produced highly accurate reduced-order models in a number of examples (see [3, 12, 13]). There are nevertheless examples in which the linear part of the dynamical system is significantly non-normal and hence E and F are not close to being orthogonal (see [6]).

Near hyperbolic fixed points, the use of DDL eliminates the need to construct involved nonlinear spectral foliations. Indeed, let us assume that the slow spectral subspace E in Theorem 4 can be decomposed into a direct sum $E = E_{1} \oplus E_{2}$ , where $E_{1}$ denotes the slowest spectral subspace with dim $E_{1} = d_{1}$ and $E_{2}$ denotes the second-slowest spectral subspace with dim $E_{2} = d_{2}$ , as sketched in Fig. 2. Reducing the dynamics to the SSM $W (E)$ is accurate for transient times given by the decay rate of $E_{2} .$ This initial reduction can be done simply by a normal projection onto E. Inside E, one can simply locate spectral foliations of the DDL-linearized systems explicitly and map them back to the original nonlinear system under the DDL transformation $κ (γ)$ . The unique class $C^{a}$ foliation of a linear system within E is the family of stable fibers forming the affine space

\begin{matrix} F_{p} = p + E_{2}, \end{matrix}

where $p \in E_{1}$ . The trajectories started inside $F_{p}$ all synchronize with $p \in E_{1}$ . The linear projection $P_{E_{2}}$ onto $E_{1}$ along directions parallel to $E_{2}$ , when applied to an initial condition $y \in F_{p}$ , returns the base point

\begin{matrix} P_{E_{2}} y = p . \end{matrix}

In the nonlinear system (1), the leaves of the smooth foliation within $W (E)$

\begin{matrix} F_{κ (p)}^{0} = κ (F_{p}) \subset W (E), \end{matrix}

where $κ (p) \in W (E_{1})$ is the image of $p$ under the mapping $κ$ defined in (37). The SSM $W (E)$ can then be parametrized via the foliation

\begin{matrix} W (E) = ⋃_{q \in W (E_{1})} F_{q}^{0} . \end{matrix}

Fig. 2 — a The linearized phase space geometry governed by the slow spectral subspace $E = E_{1} \oplus E_{2}$ and the slow invariant foliation within E. b) Phase space geometry in the original coordinates

Using DDL to predict nonlinear forced response from unforced data

We now discuss how DDL performed near the fixed point of an autonomous dynamical system can be used to predict nonlinear forced response under additional weak periodic forcing in the domain of DDL. The addition of such small forcing is frequent in structural vibration problems in which the unforced structure (e.g., a beam or disk) is rigid enough to react with small displacements under practically relevant excitation levels (see, e.g., [12, 13] for specific examples).

We append system (11) with a small, time-periodic forcing term $ε F (x, t)$ to obtain the system

\begin{matrix} \dot{x} = & A x + \tilde{f} (x) + ε F (x, t), x \in R^{n}, \\ A = & D f (0), \tilde{f} (x) = O ({|x|}^{2}), 0 \leq ϵ ≪ 1, \end{matrix}

with $F (x, t) = F (x, t + T)$ for some period $T > 0$ . If the conditions of Theorem 4 hold for the system (52) for $ε = 0$ , then, for $ϵ > 0$ small enough, exists a unique d-dimensional, T-periodic, attracting spectral submanifold $W_{ε} (E, t) \in C^{r}$ of a locally unique attracting T-periodic orbit $x_{ϵ} (t)$ perturbing from $x = 0$ (see, e.g., [10, 22]). The manifold $W_{ε} (E, t)$ is $O (ε)$ $C^{1}$ -close to $W_{0} (E, t) \equiv W (E)$ and hence its reduced dynamics can be parametrized using the reduced observable vector ${φ = ϕ |}_{W (E)}$ in the form

\begin{matrix} \dot{φ} = B φ + q (φ) + ε \hat{F} (φ, t), \\ B = D ϕ (0) T_{E} Λ_{E} {(D, ϕ, (0), T_{E})}^{- 1}, \\ q (φ) = O ({|φ|}^{2}), \\ \hat{F} (φ, t) = {(D, ϕ, (0), T_{E})}^{- 1} {(I + D h (φ, 0))}^{- 1} F (0, t) \\ + O (ε, {|φ|}^{2}), \end{matrix}

where we have relegated the details of this calculation to Appendix E.

Then the unique, $C^{r}$ change of coordinates,

\begin{matrix} φ = κ (γ) = γ + ℓ (γ), \end{matrix}

guaranteed by statement (iii) of Theorem 4 transforms the reduced dynamics (53) to its final form

\begin{matrix} \dot{γ} = B γ + ε {(I + D ℓ (γ))}^{- 1} \hat{F} (0, t) . \end{matrix}

The transformation is valid on trajectories of (52) as long as they remain in the domain of definition of the coordinate change (54).

Note that Eq. (55) is a weakly perturbed, time-periodic nonlinear system. The matrix $B$ and the nonlinear terms $ℓ (γ)$ can be determined using data from the unforced ( $ε = 0$ ) system. As a result, nonlinear time-periodic forced response can be predicted solely from unforced data by applying numerical continuation to system (55) for $ε > 0$ . This is not expected to be as accurate as SSM-based forced response prediction (see, e.g., [2, 3, 12, 13]), but nevertheless offers a way to make predictions for non-linearizable forced response based solely on DDL performed on unforced data. These predictions are valid for forced trajectories that stay in the domain of convergence of DDL carried out on the unforced system. We will illustrate such predictions using actual experimental data from fluid sloshing in Sect. 5.4.

Setting $ℓ (γ) = 0$ in formula (55) enables us to carry out a forced-response prediction based on DMD as well. Such a prediction will be fundamentally linear with respect to the forcing and can only be reasonably accurate for very small forcing amplitudes, as we will indeed see in examples. There is no systematic way to model the addition of non-autonomous forcing in the EDMD procedure, and hence EDMD will not be included in our forced response prediction comparisons.

We also note, that one might be tempted to solve an approximate version of (55) by assuming

\begin{matrix} ε {(I + D ℓ (γ))}^{- 1} \approx ε I . \end{matrix}

This assumption simplifies the computation of the forced response of the nonlinear system (55) to those of a simple linear system. Although the forced response computed using this approximate DDL method turns out to be more accurate than DMD on our example, we do not recommend this approach. This is because neglecting the nonlinear effects of the coordinate change in (55) is, in general, inconsistent with $ℓ (γ) \neq 0$ . We give more detail on this approximation in Appendix F of the Supplementary Information.

Examples

In this section, we compare the DMD, EDMD and DDL algorithms on specific examples. When applicable, we also compute the exact analytic linearization of the dynamical system near its fixed point as a benchmark. On a slow SSM $W (E)$ , an observer trajectory $φ (t)$ , starting from a select initial condition $φ (0)$ , will be tracked as the image of the linearized reduced observer trajectory $γ (t)$ under the linearizing transformation (54):

\begin{matrix} φ (t) = & κ (e^{B t}, γ, (0)) = e^{B t} γ (0) + ℓ (e^{B t}, γ, (0)), \\ γ (0) = & κ^{- 1} (φ (0)) . \end{matrix}

When model reduction has also taken place, i.e., when the observable vector $φ$ is not defined on the full phase space, we will nevertheless provide a prediction in the full phase space via the parametrization of the slow SSM.

By Theorem 1, DMD can be interpreted as setting $ℓ (γ) \equiv 0$ in (57) and finding the linear operator $B$ as a best fit from the available data. In contrast, DDL finds the linear operator $B$ , the transformation $φ = γ + ℓ (γ)$ , and its inverse simultaneously. As we explained in Sect. 4.2, EDMD cannot quite be interpreted in terms of the linearizing transformation (57) as it is an attempt to immerse the dynamics into a higher dimensional space. For our EDMD tests, we will use monomials of the observable vector $φ$ .

1D nonlinear system with two isolated fixed points

Consider the one-dimensional ODE obtained as the radial component of the Stuart–Landau equation, i.e.,

\begin{matrix} \dot{r} = μ r - r^{3}, \end{matrix}

which can be rescaled to

\begin{matrix} \dot{R} = R - R^{3} . \end{matrix}

For $R \geq 0$ , the system has a repelling fixed point at $R = 0$ and an attracting one at $R = 1$ . Page and Kerswell [44] show that local expansion of observables in terms of the Koopman eigenfunctions computed near each fixed point are possible, but the expansions at the two fixed points are not compatible with each other and both diverge at $R = \sqrt{2} / 2 \approx 0.7071$ . This is a consequence of the more general result that the Koopman eigenfunctions themselves inevitably blow up near basin boundaries (see our Proposition 1 in Appendix A of the Supplementary Information). Both DMD and EDMD can nevertheless be computed from data, even for a trajectory crossing the turning point at $R = \sqrt{2} / 2$ , but the resulting models cannot have any connection to the Koopman operator.

In each comparison performed on system (58), we generate a single trajectory in the domain of attraction of the $R = 1$ fixed point and use it as training data for DMD, EDMD and DDL. In each subplot of Fig. 3, the single training trajectory starts from the intersection of the red horizontal line “IC of training trajectory” with the $t = 0$ dashed line. We then also generate a new test trajectory (black) with its initial condition denoted with a black dot over the line $t = 0$ . We place this initial condition slightly outside the domain of linearization for system (58) (under the grey line labeled “Turning point”). We use DMD, order- $k = 5$ EDMD, and DDL trained on a single training trajectory to make predictions for the black testing trajectory (not used in the training).

Figure 3a shows DDL to be the most accurate of the three methods when applied to forward-time ( $t \geq 0$ ) segments of the test trajectory. If we try to predict the backward-time ( $t < 0$ ) segment of the same trajectory as it leaves the training domain, DDL diverges immediately upwards, whereas DMD and EDMD diverge more gradually downwards. As we increase the training domain in Fig 3b, DDL continues to be the most accurate in both forward and backward time until it reaches the domain of its training range in backward time. At that point, it diverges quickly upwards, while DMD and EDMD diverge more slowly downwards.

Importantly, increasing the approximation order for DDL first to $k = 10$ then to $k = 18$ (see Fig. 3c, d), makes DDL predictions more and more accurate in backward time inside the training domain. At the same time, the same increase in order makes EDMD less and less accurate inside the same domain. This is not surprising for EDMD because it seeks to approximate the dynamics within a Koopman-invariant subspaces for increasing k, and Koopman mode expansions blow up at the “Turning point line”, as shown both analytically and numerically by Page and Kerswell [44]. Interestingly, however, EDMD becomes less accurate even within the domain of linearization under increasing k. This is clearly visible in Fig. 3d which shows spurious, growing oscillations in the EDMD predictions close to the $R = 1$ fixed point.

In summary, of the three methods tested, DDL makes the most accurate predictions in forward time. This remains true in backward time as longs as the trajectory remains in the training range used for the three methods, even if this range is larger than the theoretical domain of linearization. Inside the training range, an increase of the order k of the monomials used increases the accuracy of DDL but introduces growing errors in EDMD.

3D linear system studied via nonlinear observables

Wu et al. [60] studied the ability of DMD to recover a 3D linear system based on the time history of three nonlinear observables evaluated on the trajectories of the system. To define the linear system, they use a block-diagonal matrix $Λ$ and a basis transformation matrix $R$ of the form

\begin{matrix} Λ = & (\begin{matrix} a & - b & 0 \\ b & a & 0 \\ 0 & 0 & c \end{matrix}), a, b, c \in R, \\ R = & (\begin{matrix} 1 & 0 & sin θ_{1} cos θ_{2} \\ 0 & 1 & sin θ_{1} sin θ_{2} \\ 0 & 0 & cos θ_{2} \end{matrix}), \end{matrix}

to define the linear discrete dynamical system

\begin{matrix} x (n + 1) = (R, Λ, R^{- 1}) x (n) . \end{matrix}

The linear change of coordinates $R$ rotates the real eigenspace of $Λ$ corresponding to the eigenvalue c and hence introduces non-normality in system (60). This system is then assumed to be observed via a 3D nonlinear observable vector

\begin{matrix} y (x) = (\begin{matrix} x_{1} + 0.1 (x_{1}^{2} + x_{2} x_{3}) \\ x_{2} + 0.1 (x_{2}^{2} + x_{1} x_{3}) \\ x_{3} + 0.1 (x_{3}^{2} + x_{1} x_{2}) \end{matrix}) . \end{matrix}

Ideally, DMD should closely approximate the linear dynamics of system (60) because the observable function defined in Eq. (61) is close to the identity and has only weak nonlinearities. Wu et al. [60] find, however, that this system poses a challenge for DMD, which produced inaccurate predictions for the spectrum of $R Λ R^{- 1}$ .

Following one of the parameter settings of Wu et al. [60], we set $a = 0.45 \sqrt{3}$ , $b = 0.5$ , $c = 0.6$ , $θ_{1} = 1.5$ , and $θ_{2} = 0$ . We initialize three training trajectories with $∥x (0)∥ < 1,$ each containing 100 iterations of system (60). We then compute the predictions of a $5^{th}$ order DDL model and compare to those of DMD and EDMD on a separate test trajectory not used in training these three methods. The predictions and the spectrum obtained from the three methods are shown in Fig. 4.

Fig. 4 — Predictions by DMD, EDMD, and DDL on the discrete dynamical system (59) and (61). a Predicted and true $y_{1} -$ components of a test trajectory. b Spectra identified by DMD, EDMD, and DDL superimposed on the true spectrum (marked by crosses). The dashed line represents the unit circle. (Color figure online)

The predictions of DMD and EDMD can only be considered accurate for very low amplitude oscillations, while DDL returns accurate predictions throughout the whole trajectory. This example consists of linear dynamics and monomial observables of the state, and hence should be an ideal test case for EDMD. Yet, EDMD is inaccurate in identifying the spectrum of system (60). Indeed, as seen in Fig. 4b, a number of spurious eigenvalues arise from EDMD, both real and complex. DMD performs clearly better but it is still markedly less accurate than DDL. These inaccuracies in the predictions of EDMD and DMD spectra are also reflected by considerable errors in their predictions for trajectories, as seen in Fig. 4a. In contrast, DDL produces the most accurate prediction for the test trajectory.

Damped and periodically forced Duffing equation

We consider the damped and forced Duffing equation

\begin{matrix} \dot{x} & = y, \\ \dot{y} & = x - x^{3} - d y + ε cos Ω t, \end{matrix}

with damping coefficient $d = 0.0141$ , forcing frequency $Ω$ and forcing amplitude $ε$ . We perform a change of coordinates $(x, y) \mapsto φ = (φ_{1}, φ_{2})$ that moves the stable focus at $(x, y) = (1, 0)$ to the origin and makes the linear part block-diagonal. The resulting system is of the form

\begin{matrix} \dot{φ} = A φ + f (φ) + ε \hat{F} (t), f (φ) = O ({|φ|}^{2}), \end{matrix}

where

\begin{matrix} A = (\begin{matrix} - α & - ω \\ ω & - α \end{matrix}), ω = 1.4142, α = 0.00707, \end{matrix}

and $\hat{F} (t)$ is the transformed image of the physical forcing vector in (62). We first consider the unforced system with $ε = 0$ . In this case, the 2D slow SSM of the fixed point coincides with the phase space $R^{2}$ and hence no further model reduction is possible. However, since the non-resonance conditions (34) hold for the linear part (64), the system is analytically linearizable near the origin. The linearizing transformation and its inverse can both be computed from Eq. (63), as outlined in Eq. (41). For reference, we carry out this linearization analytically up to order $k = 9$ . The Taylor series of the linearization is estimated to converge for $|φ| < R_{crit} \approx 0.15$ . The details of the calculation can be found in the repository [28].

We now compare the analytic linearization results it to DMD, EDMD and DDL, with all three trained on the same three trajectories, launched both inside and outside the domain of convergence of the analytic linearization. The polynomial order of approximation is $k = 5$ for both the EDMD and the DDL algorithms. The performance of the various methods is compared in Fig. 5. Close to the fixed point, in the domain of convergence of the analytic linearization, all three methods perform well. Moving away from the fixed point, the analytic linearization is no longer possible. Both DMD and EDMD perform worse, while DDL continues to accurately linearize the system even outside the domain of convergence of the analytic linearization.

Fig. 5 — Comparison of the time evolution of the linearized trajectories (red) and the full trajectories of the nonlinear system (63) (blue), (63). a–d Analytic linearization, DMD, EDMD and DDL models trained and evaluated on trajectories inside the domain of convergence. e–h Same as (a)–(d) but outside the domain of convergence of the analytic linearization. (Color figure online)

Using formula (55) and our DDL-based model, we can also predict the response of system (63) for the forcing term of the form

\begin{matrix} ε \hat{F} (t) = ε (\begin{matrix} - 0.006 \\ 1.225 \end{matrix}) cos Ω t, \end{matrix}

without using any data from the forced system. As the forced DDL model (55) is nonlinear, it can capture non-linearizable phenomena such as coexisting of stable and unstable periodic orbits arising under the forcing. We can also make a forced response prediction from DMD simply by setting $D ℓ (γ) = 0$ in Eq. (55). As an inhomogeneous linear system of ODEs, however, this forced DMD model cannot predict coexisting stable and unstable periodic orbits.

In Fig. 6, we compare the forced predictions of the analytic linearization, DMD, and DDL to those computed from the nonlinear system directly via the continuation software COCO of Dankowicz and Schilder [14]. Since the forced and linearized systems are also nonlinear, we use the same continuation software to determine the stable and unstable branches of periodic orbits.

As expected, the analytic linearization is accurate while the forced response is inside the domain of convergence but deteriorates quickly for larger amplitudes. DMD gives good predictions for the peaks of the forced response diagrams, but cannot account for any of the nonlinear softening behavior, i.e., the overhangs in the curves that signal multiple coexisting periodic responses at the same forcing frequency. In contrast, while the DDL model of order $k = 5$ starts becoming inaccurate for peak prediction at larger amplitudes outside the domain of analytic linearization, it continues to capture accurately the overhangs arising from non-linearizable forced response away from the peaks. Notably, DDL even identifies the unstable branches (in dashed lines) of the periodic response accurately. For completeness, we also show results of approximate DDL, by assuming (56) in Appendix F.

Water sloshing experiment in a tank

In this section, we analyze experimental data generated by Bäuerlein and Avila [5] for forced and unforced fluid sloshing in a tank. Previous studies of this data set used nonlinear SSM-reduction to predict forced response [2, 3, 12]. Here we will use DMD and DDL to extract and compare linear reduced-order models from unforced trajectory data, then use them to predict and verify forced response curves obtained from forced trajectory data. Neither DMD nor DDL is expected to outperform the fully nonlinear approach of SSM reduction, so we will only compare them against each other.

The tank in the experiments is mounted on a platform that is displaced sinusoidally in time with various forcing amplitudes and frequencies (Fig. 7a). To train DMD and DDL, we use unforced sloshing data obtained by freezing the movement of the tank near a resonance and recording the ensuing decaying oscillations of the water surface with a camera under they die out. The resulting videos serve as input data to our analysis. Specifically, the horizontal position of the center of mass of the fluid is extracted from each video frame tracked and used as the single scalar observable.

Fig. 7 — a Schematic representation of the experimental setup (adopted from [12]). b Prediction of the decay of a test trajectory with order- $k = 5$ DDL. c Prediction of the forced response from DMD. d Prediction of the forced response from DDL. Light shading indicates the domain, in which training data for DMD and DDL was available

During such a resonance decay experiment, the system approaches its stable unforced equilibrium via oscillations that are dominated by a single mode. In terms of the phase space geometry, this means an approach to a stable fixed point along its 2D slow SSM $W (E)$ tangent to the slowest 2D real eigenspace E . As we only have a single observable from the videos, we use delay embedding to generate a larger observable space that can accommodate the 2D manifold $W (E)$ . As discussed by Cenedese et al. [12], we need an at least 5D observable space for this purpose by the Takens embedding theorem. In this space, $W (E)$ turns out to be nearly flat for short delays (see [2]), which allows us to use a linear approximation for its parametrization. The reduced coordinates on $W (E) \approx E$ can then be identified via a singular value decomposition of the data after one removes initial transients from the experimental data. The end of the transients can be identified as a point beyond which a frequency analysis of the data shows only one dominant frequency, the imaginary part of the eigenvalue corresponding to E.

All this analysis has been carried out using the publicly available SSMLearn package Cenedese et al. [11]. With $W (E)$ identified, we use the DDL method with order $k = 5$ to find the linearizing transformation and the linearized dynamics on $W (E)$ . In Fig 7b we show the prediction of the model on a decaying trajectory reserved for testing. The displacements are reported as percentage values, with respect to the depth of the tank. In Fig. 7c and d, we show predictions from DMD and DDL models for the forced response, compared with the experimentally observed response. Since the exact forcing function is unknown, we follow the calibration procedure outlined by Cenedese et al. [12] to find an equivalent forcing amplitude in the reduced-order model.

We present data for three forcing amplitudes. The DDL predictions are accurate up to $0.17 %$ amplitude, even capturing the softening trend. The largest-amplitude forcing resulted in response significantly outside the range of the training data; in this range, we were unable to find the converged forced response from DDL. We also show the corresponding DMD-predictions in Fig. 7c. Although the linear response can formally be evaluated for any forcing amplitude, DMD shows no trace of the softening trend, and is even inaccurate for low forcing amplitudes.

Model reduction and foliation in a nonlinear oscillator chain

As a final example, we consider the dynamics of a chain of nonlinear oscillators, which has been analyzed in the SSMLearn package [11]. Denoting the positions of the oscillators as $q_{i}$ for $i = 1, . . ., 5$ , we assume that the springs and dampers are linear, except for the first oscillator. The non-dimensionalized equations of motion can be written as

\begin{matrix} M \ddot{q} + C \dot{q} + K q + f (q, \dot{q}) = 0, \end{matrix}

where $M = I$ ; the springs have the same linear stiffness $k = 1$ which is encoded in $K$ via nearest-neighbor coupling. The damping is assumed to be proportional, i.e., we specifically set $C = 0.002 M + 0.005 K$ .

Three numerically generated training trajectories show decay to the $q = 0$ fixed point, as expected from the damped nature of the linear part of the system. In this example, we also seek to capture some of the transients, which motivates us to select the slow SSM $W (E)$ to be 4D, tangent to the spectral subspace $E = E_{1} \oplus E_{2}$ spanned by the the slowest mode ( $E_{1})$ and the second slowest mode ( $E_{2}$ ). As the mode corresponding to $E_{2}$ does disappear over time from the decaying signal, there is no resonance between the eigenvalues and hence Theorem 4 is applicable. As already noted, numerical data from a generic physical system described by Eq. (66) will be free from resonances. An exception is a $1 : 1$ resonance arising from a perfect symmetry, but this resonance is not excluded by Theorem 4 and has is amenable to DDL.

Within the 4D SSM $W (E)$ , we also demonstrate how to optimally reduce the dynamics to its 2D slowest SSM $W (E_{1})$ . As explained in Sect. 4.3.3, to find the trajectory in $W (E_{1})$ with which a given trajectory close to $W (E)$ ultimately synchronizes, we need to project along a point $q_{0}$ of the full trajectory $q (t)$ first onto $W (E)$ orthogonally to obtain a point $q_{0}^{4 D} \in W (E)$ . We then need to identify the stable fiber $F_{q_{0}^{2 D}}$ in $W (E)$ for which $q_{0}^{4 D} \in F_{q_{0}^{2 D}}$ holds. Finally, one has to project along $F_{q_{0}^{2 D}}$ to locate its base point $q_{0}^{2 D} \in W (E_{1})$ . The trajectory through $q_{0}^{2 D}$ in $W (E_{1})$ will then be the one with which the full trajectory $q (t)$ will synchronize faster than with any other trajectory. As noted in Sect. 4.3.3, computing the full nonlinear stable foliation

\begin{matrix} W (E) = ⋃_{q_{0}^{2 D} \in W (E_{1})} F_{q_{0}^{2 D}}^{0} \end{matrix}

of $W (E)$ is simple in the linearized coordinates, in which it can be achieved via a linear projection along the faster eigenspace $E_{2}$ .

We use a third-order polynomial approximation for $W (E)$ based on the three training trajectories. The polynomials depend on the reduced coordinates we introduce along E using a singular value decomposition of the trajectory data. These reduced coordinates are shown in Fig. 8, where we show a representative training trajectory, the 2D slow SSM $E_{1}$ , as well as the foliation (67) computed from DDL for this specific problem.

Fig. 8 — Reduced coordinates of the 4D SSM $W (E)$ for the oscillator chain. The slow 2D SSM $W (E_{1})$ , a typical trajectory and its projection to the slow SSM along the fibers $F_{q_{0}^{2 D}}^{0}$ are also shown. Panel a shows the linearized coordinates computed from DDL, and b shows their image under the inverse of the linearizing transformation. The order of approximation used in DDL is 3

We also evaluate the DDL-based predictions on $W (E)$ and $W (E_{1})$ by comparing them to predictions from DMD and EDMD. Performing DMD and EDMD with the data first projected to E can be interpreted as finding the linear approximation to the dynamics in $W (E)$ . Similarly, performing DMD and EDMD with the data first projected to $E_{1}$ can be interpreted as finding the linear approximation to the dynamics in $W (E_{1})$ . These are to be contrasted with performing DDL that finds the linearized reduced dynamics within $W (E)$ , which in turn contains the linearized reduced dynamics within $W (E_{1})$ . Figure 9 shows that DMD and EDMD both perform similarly to DDL on $W (E)$ . However, the 2D DMD and EDMD results obtained for $W (E_{1})$ are noticeably less accurate than the DDL results.

Fig. 9 — Predictions of a DMD b EDMD and c DDL models on a test trajectory of the oscillator chain. The order of approximation for DDL and EDMD is $k = 3$

Conclusions

We have given a new mathematical justification for the broadly used DMD procedure to eliminate the shortcomings of prior proposed justifications. Specifically, we have shown that under specific non-degeneracy conditions on the n-dimensional dynamical system, on $d \leq n$ observable functions defined for that system, and on the actual data from these observables, DMD gives a leading-order approximation to the observable dynamics on an attracting d-dimensional spectral submanifold (SSM) of the system.

This result covers both discrete and continuous dynamical systems even for $n = \infty$ . Our Theorem 1 only makes explicit non-degeneracy assumptions on the observables which will hold with probability one in practical applications. This is to be contrasted with prior approaches to DMD and its variants based on the Koopman operator, whose assumptions on the observables fail with probability one on generic observables.

Our approach also yields a systematic procedure that gradually refines the leading-order DMD approximation of the reduced observable dynamics on SSMs to higher orders. This procedure, which we call data-driven linearization (DDL), builds a nonlinear coordinate transformation under which the observable becomes linear on the attracting SSM. We have shown on several examples how DDL indeed outperforms DMD and extended DMD (EDMD), as expected. In addition to this performance increase, DDL also enables a prediction of truly nonlinear forced response from unforced data within its training range. Although we have only illustrated this for periodically forced water sloshing experiments in a tank, recent results on aperiodically time-dependent SSMs by Haller and Kaundinya [19] allow us to predict more general forced response using DDL trained on unforced observable data.

Despite all these advantages, DDL (as any linearization method) remains applicable only in parts of the phase space where the dynamics are linearizable. Yet SSMs continue to exist across basin boundaries and hence are able to carry characteristically nonlinear dynamics with multiple coexisting attractors. For such nonlinearizable dynamics, data-driven nonlinear SSM-reduction algorithms, such as SSMLearn and fastSSM, are preferable and have been showing high accuracy and predictive ability in a growing number of physical settings (see, e.g., [1, 3, 12, 13, 26, 27, 37]).

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file 1 (pdf 1978 KB)^{(1.9MB, pdf)}

Acknowledgements

We are grateful to Matthew Kvalheim and Shai Revzen for several helpful comments on an earlier version of this manuscript.

Funding

Open access funding provided by Swiss Federal Institute of Technology Zurich This work was supported by the Swiss National Science Foundation.

Data availability

All data and codes used in this work are downloadable from the repository https://github.com/haller-group/DataDrivenLinearization.

Declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Footnotes

Assuming that each coordinate component $x_{j}$ of the full phase space vector $x$ of system (1) falls in a span of the same set of Koopman eigenfunctions $\{ϕ_{1} (x), \dots, ϕ_{N} (x)\}$ , one defines the $j^{th}$ Koopman mode associated with $x_{j}$ as the vector $v_{j} = {(v_{1 j}, \dots, v_{Nj})}^{T}$ for which $x_{j} = \sum_{i = 1}^{N} v_{ij} ϕ_{i} (x)$ (see, e.g., Williams et al. [58]).

More precisely, Fenichel’s foliation results become applicable after the wormhole construct in Proposition B1 of Eldering et al. [17] is applied to extend $W (E)$ smoothly into a compact normally attracting invariant manifold without boundary. This is needed because Fenichel’s results only apply to compact normally attracting invariant manifolds with an empty or overflowing boundary, whereas the boundary of $W (E)$ is originally inflowing.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

1.Alora, J.I., Cenedese, M., Schmerling, E., Haller, G., Pavone, M.: Practical deployment of spectral submanifold reduction for optimal control of high-dimensional systems. IFAC PapersOnLine 56–2, 4074–4081 (2023) [Google Scholar]
2.Axås, J., Haller, G.: Model reduction for nonlinearizable dynamics via delay-embedded spectral submanifolds. Nonlinear Dyn. (2023). 10.1007/s11071-023-08705-2 10.1007/s11071-023-08705-2 [DOI] [Google Scholar]
3.Axås, J., Cenedese, M., Haller, G.: Fast data-driven model reduction for nonlinear dynamical systems. Nonlinear Dyn. (2022). 10.1007/s11071-022-08014-0 10.1007/s11071-022-08014-0 [DOI] [Google Scholar]
4.Bates, D.M., Watts, D.G.: Nonlinear Regression Analysis and Its Applications. Wiley, Hoboken (1988) [Google Scholar]
5.Bäuerlein, B., Avila, K.: Phase lag predicts nonlinear response maxima in liquid-sloshing experiments. J. Fluid Mech. (2021). 10.1017/jfm.2021.576 10.1017/jfm.2021.576 [DOI] [Google Scholar]
6.Bettini, L., Kaszás, B., Zybach, B., Dual, J., Haller, G.: Model reduction to spectral submanifolds via oblique projection. Preprint (2024)
7.Bollt, E.M., Li, Q., Dietrich, F., Kevrekidis, I.: On matching, and even rectifying, dynamical systems through Koopman operator eigenfunctions. SIAM J. Appl. Dyn. Syst. 17(2), 1925–1960 (2018) [Google Scholar]
8.Budišić, M., Mohr, R., Mezić, I.: Applied Koopmanism. Chaos Interdiscip. J. Nonlinear Sci. 22, 047510 (2012) [DOI] [PubMed] [Google Scholar]
9.Buza, G.: Spectral submanifolds of the Navier–Stokes equations. SIAM J. Appl. Dyn. Syst. 23(2), 1052–1089 (2024) [Google Scholar]
10.Cabré, X., Fontich, E., de la Llave, R.: The parameterization method for invariant manifolds i: manifolds associated to non-resonant subspaces. Indiana Univ. Math. J. 52(2), 283–328 (2003) [Google Scholar]
11.Cenedese, M., Axås, J., Haller, G.: SSMLearn. https://github.com/haller-group/SSMLearn (2021)
12.Cenedese, M., Axås, J., Bäuerlein, B., Avila, K., Haller, G.: Data-driven modeling and prediction of non-linearizable dynamics via spectral submanifolds. Commun, Nat (2022a). 10.1038/s41467-022-28518-y [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Cenedese, M., Axås, J., Yang, H., Eriten, M., Haller, G.: Data-driven nonlinear model reduction to spectral submanifolds in mechanical systems. Philos. Trans. R. Soc. A 380, 20210194 (2022b) [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Dankowicz, H., Schilder, F.: Recipes for Continuation. SIAM, Philadelphia (2013) [Google Scholar]
15.Domb, C., Sykes, M.F.: On the susceptibility of a ferromagnetic above the curie point. Proc. R. Soc. Lond. Ser. A Math. Phys. Sci. 240(1221), 214–228 (1957) [Google Scholar]
16.Elbialy, S.M.: Local contractions of Banach spaces and spectral gap conditions. J. Func. Anal. 182, 108–150 (2001) [Google Scholar]
17.Eldering, J., Kvalheim, M., Revzen, S.: Global linearization and fiber bundle structure of invariant manifolds. Nonlinearity 31, 4202–4245 (2018) [Google Scholar]
18.Fenichel, N.: Persistence and smoothness of invariant manifolds for flows. Indiana Univ. Math. J. 21(3), 193–226 (1971) [Google Scholar]
19.Haller, G., Kaundinya, R.: Nonlinear model reduction to temporally aperiodic spectral submanifolds. Chaos 34, 043152 (2024) [DOI] [PubMed] [Google Scholar]
20.Guckenheimer, J., Holmes, P.: Nonlinear oscillations, dynamical systems and bifurcation of vector fields. Springer, New York (1983) [Google Scholar]
21.Haller, G., Kaszás, B., Liu, A., Axås, J.: Nonlinear model reduction to fractional and mixed mode spectral submanifolds. Chaos 33(6), 063138 (2023) [DOI] [PubMed] [Google Scholar]
22.Haller, G., Ponsioen, S.: Nonlinear normal modes and spectral submanifolds: existence, uniqueness and use in model reduction. Nonlinear Dyn. 86(3), 1493–1534 (2016) [Google Scholar]
23.Hartman, P.: On local homeomorphisms of Euclidean spaces. Bol. Soc. Mat. Mex. 5, 220–241 (1960) [Google Scholar]
24.Jain, S., Thurner, T., Li, M., Haller, G.: SSMTool 2.3: computation of invariant manifolds and their reduced dynamics in high-dimensional mechanics problems, pp. 1417–1450 (2023). 10.5281/zenodo.4614201
25.Jentzsch, Robert: Untersuchungen zur theorie der folgen analytischer funktionen. Acta Math. 41, 219–251 (1916) [Google Scholar]
26.Kaszás, B., Haller, G.: Capturing the edge of chaos as a spectral submanifold in pipe flows. J. Fluid. Mech. 979, A48 (2024) [Google Scholar]
27.Kaszás, B., Cenedese, M., Haller, G.: Dynamics-based machine learning of transitions in Couette flow. Phys. Rev. Fluids. 7, L082402 (2022) [Google Scholar]
28.Kaszás, B., Haller, G.: Data-driven linearization: numerical implementation. https://github.com/haller-group/DataDrivenLinearization (2024)
29.Kingma, D., Ba, J.: Adam: a method for stochastic optimization. In: International Conference on Learning Representations (ICLR) (2015)
30.Kogelbauer, F., Haller, G.: Rigorous model reduction for a damped-forced nonlinear beam model: an infinite-dimensional analysis. J. Nonlinear Sci. 28, 1109–1150 (2018) [Google Scholar]
31.Koopman, B.O.: Hamiltonian systems and transformation Hilbert space. Proc. Natl. Acad. Sci. USA 7, 315–318 (1931) [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Kutz, J.N., Brunton, S.L., Brunton, B.W., Proctor, J.L.: Dynamic Mode Decomposition. SIAM, Philadelphia (2016) [Google Scholar]
33.Kvalheim, M.D., Arathoon, P.: Linearizability of flows by embeddings. arXiv:2305.18288 (2023)
34.Kvalheim, M.D., Revzen, S.: Existence and uniqueness of global koopman eigenfunctions for stable fixed points and periodic orbits. Physica D 425, 132959 (2021)
35.Lan, Y., Mezic, I.: Linearization in the large of nonlinear systems and Koopman operator spectrum. Physica D 242, 42–53 (2013)
36.Li, Q., Dietrich, F., Bollt, E.M., Kevrekidis, I.G.: Extended dynamic mode decomposition with dictionary learning: a data-driven adaptive spectral decomposition of the Koopman operator. Chaos 27(10), 103111 (2017) [DOI] [PubMed] [Google Scholar]
37.Liu, A., Axås, J., Haller, G.: Data-driven modeling and forecasting of chaotic dynamics on inertial manifolds constructed as spectral submanifolds. Chaos 34, 033140 (2022) [DOI] [PubMed] [Google Scholar]
38.Liu, Z., Ozay, N., Sontag, E.D.: On the non-existence of immersions for systems with multiple omega-limit sets. IFAC-PapersOnLine 56(2), 60–64 (2023) [Google Scholar]
39.Liu, Z., Ozay, N., Sontag, E.D.: Properties of immersions for systems with multiple limit sets with implications to learning Koopman embeddings pp. 1–14. arXiv:2312.17045 (2024)
40.Mezić, I.: Analysis of fluid flows via spectral properties of the Koopman operator. Ann. Rev. Fluid Mech. 45(1), 357–378 (2013) [Google Scholar]
41.Newhouse, S.E.: On a differentiable linearization theorem of Philip Hartman. Contemp. Math. 692, 209–262 (2017) [Google Scholar]
42.Otto, S.E., Padovan, A., Rowley, C.W.: Optimizing oblique projections for nonlinear systems using trajectories. SIAM J. Sci. Comput. 44(3), A1681–A1702 (2022) [Google Scholar]
43.Otto, S.E., Padovan, A., Rowley, C.W.: Model reduction for nonlinear systems by balanced truncation of state and gradient covariance. SIAM J. Sci. Comput. 45(5), A2325–A2355 (2023) [Google Scholar]
44.Page, J., Kerswell, R.R.: Koopman mode expansions between simple invariant solutions. J. Fluid Mech. 879, 1–27 (2019) [Google Scholar]
45.Paszke, A., Gross, S., Chintala, S., Chanan, G., Yang, E., DeVito, Z., Lin, Z., Desmaison, A., Antiga, L., Lerer, A.: Automatic differentiation in pytorch. In: NIPS-W (2017)
46.Poincaré, H.: Les Méthodes Nouvelles de la Mécanique Céleste. Gauthier-Villars et Fils, Paris (1892) [Google Scholar]
47.Ponsioen, S., Pedergnana, T., Haller, G.: Automated computation of autonomous spectral submanifolds for nonlinear modal analysis. J. Sound Vib. 420, 269–295 (2018) [Google Scholar]
48.Rowley, C.W.: Model reduction for fluids, using balanced proper orthogonal decomposition. Int. J. Bifurc. Chaos 15(03), 997–1013 (2005) [Google Scholar]
49.Rowley, C.W., Mezić, I., Bagheri, S., Schlachter, P., Henningson, D.S.: Spectral analysis of nonlinear flows. J. Fluid Mech. 641, 115–127 (2009) [Google Scholar]
50.Schmid, P.J.: Dynamic mode decomposition of numerical and experimental data. J. Fluid Mech. 656, 5–28 (2010) [Google Scholar]
51.Schmid, P.J.: Dynamic mode decomposition and its variants. Ann. Rev. Fluid Mech. 54, 225–254 (2022) [Google Scholar]
52.Sternberg, S.: Local contractions and theorem of Poincaré. Am. J. Math. 79(4), 809–824 (1957) [Google Scholar]
53.Sternberg, S.: On the structure of local homeomorphisms of Euclidean -space. II. Am. J. Math. 80(3), 623–631 (1958) [Google Scholar]
54.Szaksz, B.: The stabilizing and destabilizing effects of time delays in nonlinear dynamical systems. Ph.D. Thesis, Budapest University of Technology and Economics (2024)
55.Szalai, R.: Invariant spectral foliations with applications to model order reduction and synthesis. Nonlinear Dyn. 101, 2645–2669 (2020) [Google Scholar]
56.van Strien, S.: Smooth linearization of hyperbolic fixed points without resonance conditions. J. Differ. Equ. 85(1), 66–90 (1990) [Google Scholar]
57.Virtanen, P., Gommers, R., Oliphant, T.E., Haberland, M., Reddy, T., Cournapeau, D., Burovski, E., Peterson, P., Weckesser, W., Bright, J., van der Walt, S.J., Brett, M., Wilson, J., Jarrod Millman, K., Mayorov, N., Nelson, A.R.J., Jones, E., Kern, R., Larson, E., Carey, C.J., Polat, İ, Feng, Y., Moore, E.W., VanderPlas, J., Laxalde, D., Perktold, J., Cimrman, R., Henriksen, I., Quintero, E.A., Harris, C.R., Archibald, A.M., Ribeiro, A.H., Pedregosa, F., van Mulbregt, P.: SciPy 1.0 Contributors. SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat. Methods 17, 261–272 (2020) [DOI] [PMC free article] [PubMed]
58.Williams, M.O., Kevrekidis, I.G., Rowley, C.W.: A data-driven approximation of the Koopman operator: extending dynamic mode decomposition. J. Nonlinear Sci. 9, 1307–1346 (2015) [Google Scholar]
59.Williams, M.O., Rowley, C.W., Kevrekidis, I.G.: A kernel-based method for data-driven Koopman spectral analysis. J. Comput. Dyn. 2, 247–265 (2015) [Google Scholar]
60.Wu, Z., Brunton, S.L., Revzen, S.: Challenges in dynamic mode decomposition. J. R. Soc. Interface 18(185), 20210686 (2021) [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary file 1 (pdf 1978 KB)^{(1.9MB, pdf)}

Data Availability Statement

All data and codes used in this work are downloadable from the repository https://github.com/haller-group/DataDrivenLinearization.

[CR1] 1.Alora, J.I., Cenedese, M., Schmerling, E., Haller, G., Pavone, M.: Practical deployment of spectral submanifold reduction for optimal control of high-dimensional systems. IFAC PapersOnLine 56–2, 4074–4081 (2023) [Google Scholar]

[CR2] 2.Axås, J., Haller, G.: Model reduction for nonlinearizable dynamics via delay-embedded spectral submanifolds. Nonlinear Dyn. (2023). 10.1007/s11071-023-08705-2 10.1007/s11071-023-08705-2 [DOI] [Google Scholar]

[CR3] 3.Axås, J., Cenedese, M., Haller, G.: Fast data-driven model reduction for nonlinear dynamical systems. Nonlinear Dyn. (2022). 10.1007/s11071-022-08014-0 10.1007/s11071-022-08014-0 [DOI] [Google Scholar]

[CR4] 4.Bates, D.M., Watts, D.G.: Nonlinear Regression Analysis and Its Applications. Wiley, Hoboken (1988) [Google Scholar]

[CR5] 5.Bäuerlein, B., Avila, K.: Phase lag predicts nonlinear response maxima in liquid-sloshing experiments. J. Fluid Mech. (2021). 10.1017/jfm.2021.576 10.1017/jfm.2021.576 [DOI] [Google Scholar]

[CR6] 6.Bettini, L., Kaszás, B., Zybach, B., Dual, J., Haller, G.: Model reduction to spectral submanifolds via oblique projection. Preprint (2024)

[CR7] 7.Bollt, E.M., Li, Q., Dietrich, F., Kevrekidis, I.: On matching, and even rectifying, dynamical systems through Koopman operator eigenfunctions. SIAM J. Appl. Dyn. Syst. 17(2), 1925–1960 (2018) [Google Scholar]

[CR8] 8.Budišić, M., Mohr, R., Mezić, I.: Applied Koopmanism. Chaos Interdiscip. J. Nonlinear Sci. 22, 047510 (2012) [DOI] [PubMed] [Google Scholar]

[CR9] 9.Buza, G.: Spectral submanifolds of the Navier–Stokes equations. SIAM J. Appl. Dyn. Syst. 23(2), 1052–1089 (2024) [Google Scholar]

[CR10] 10.Cabré, X., Fontich, E., de la Llave, R.: The parameterization method for invariant manifolds i: manifolds associated to non-resonant subspaces. Indiana Univ. Math. J. 52(2), 283–328 (2003) [Google Scholar]

[CR11] 11.Cenedese, M., Axås, J., Haller, G.: SSMLearn. https://github.com/haller-group/SSMLearn (2021)

[CR12] 12.Cenedese, M., Axås, J., Bäuerlein, B., Avila, K., Haller, G.: Data-driven modeling and prediction of non-linearizable dynamics via spectral submanifolds. Commun, Nat (2022a). 10.1038/s41467-022-28518-y [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR13] 13.Cenedese, M., Axås, J., Yang, H., Eriten, M., Haller, G.: Data-driven nonlinear model reduction to spectral submanifolds in mechanical systems. Philos. Trans. R. Soc. A 380, 20210194 (2022b) [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR14] 14.Dankowicz, H., Schilder, F.: Recipes for Continuation. SIAM, Philadelphia (2013) [Google Scholar]

[CR15] 15.Domb, C., Sykes, M.F.: On the susceptibility of a ferromagnetic above the curie point. Proc. R. Soc. Lond. Ser. A Math. Phys. Sci. 240(1221), 214–228 (1957) [Google Scholar]

[CR16] 16.Elbialy, S.M.: Local contractions of Banach spaces and spectral gap conditions. J. Func. Anal. 182, 108–150 (2001) [Google Scholar]

[CR17] 17.Eldering, J., Kvalheim, M., Revzen, S.: Global linearization and fiber bundle structure of invariant manifolds. Nonlinearity 31, 4202–4245 (2018) [Google Scholar]

[CR18] 18.Fenichel, N.: Persistence and smoothness of invariant manifolds for flows. Indiana Univ. Math. J. 21(3), 193–226 (1971) [Google Scholar]

[CR19] 19.Haller, G., Kaundinya, R.: Nonlinear model reduction to temporally aperiodic spectral submanifolds. Chaos 34, 043152 (2024) [DOI] [PubMed] [Google Scholar]

[CR20] 20.Guckenheimer, J., Holmes, P.: Nonlinear oscillations, dynamical systems and bifurcation of vector fields. Springer, New York (1983) [Google Scholar]

[CR21] 21.Haller, G., Kaszás, B., Liu, A., Axås, J.: Nonlinear model reduction to fractional and mixed mode spectral submanifolds. Chaos 33(6), 063138 (2023) [DOI] [PubMed] [Google Scholar]

[CR22] 22.Haller, G., Ponsioen, S.: Nonlinear normal modes and spectral submanifolds: existence, uniqueness and use in model reduction. Nonlinear Dyn. 86(3), 1493–1534 (2016) [Google Scholar]

[CR23] 23.Hartman, P.: On local homeomorphisms of Euclidean spaces. Bol. Soc. Mat. Mex. 5, 220–241 (1960) [Google Scholar]

[CR24] 24.Jain, S., Thurner, T., Li, M., Haller, G.: SSMTool 2.3: computation of invariant manifolds and their reduced dynamics in high-dimensional mechanics problems, pp. 1417–1450 (2023). 10.5281/zenodo.4614201

[CR25] 25.Jentzsch, Robert: Untersuchungen zur theorie der folgen analytischer funktionen. Acta Math. 41, 219–251 (1916) [Google Scholar]

[CR26] 26.Kaszás, B., Haller, G.: Capturing the edge of chaos as a spectral submanifold in pipe flows. J. Fluid. Mech. 979, A48 (2024) [Google Scholar]

[CR27] 27.Kaszás, B., Cenedese, M., Haller, G.: Dynamics-based machine learning of transitions in Couette flow. Phys. Rev. Fluids. 7, L082402 (2022) [Google Scholar]

[CR28] 28.Kaszás, B., Haller, G.: Data-driven linearization: numerical implementation. https://github.com/haller-group/DataDrivenLinearization (2024)

[CR29] 29.Kingma, D., Ba, J.: Adam: a method for stochastic optimization. In: International Conference on Learning Representations (ICLR) (2015)

[CR30] 30.Kogelbauer, F., Haller, G.: Rigorous model reduction for a damped-forced nonlinear beam model: an infinite-dimensional analysis. J. Nonlinear Sci. 28, 1109–1150 (2018) [Google Scholar]

[CR31] 31.Koopman, B.O.: Hamiltonian systems and transformation Hilbert space. Proc. Natl. Acad. Sci. USA 7, 315–318 (1931) [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR32] 32.Kutz, J.N., Brunton, S.L., Brunton, B.W., Proctor, J.L.: Dynamic Mode Decomposition. SIAM, Philadelphia (2016) [Google Scholar]

[CR33] 33.Kvalheim, M.D., Arathoon, P.: Linearizability of flows by embeddings. arXiv:2305.18288 (2023)

[CR34] 34.Kvalheim, M.D., Revzen, S.: Existence and uniqueness of global koopman eigenfunctions for stable fixed points and periodic orbits. Physica D 425, 132959 (2021)

[CR35] 35.Lan, Y., Mezic, I.: Linearization in the large of nonlinear systems and Koopman operator spectrum. Physica D 242, 42–53 (2013)

[CR36] 36.Li, Q., Dietrich, F., Bollt, E.M., Kevrekidis, I.G.: Extended dynamic mode decomposition with dictionary learning: a data-driven adaptive spectral decomposition of the Koopman operator. Chaos 27(10), 103111 (2017) [DOI] [PubMed] [Google Scholar]

[CR37] 37.Liu, A., Axås, J., Haller, G.: Data-driven modeling and forecasting of chaotic dynamics on inertial manifolds constructed as spectral submanifolds. Chaos 34, 033140 (2022) [DOI] [PubMed] [Google Scholar]

[CR38] 38.Liu, Z., Ozay, N., Sontag, E.D.: On the non-existence of immersions for systems with multiple omega-limit sets. IFAC-PapersOnLine 56(2), 60–64 (2023) [Google Scholar]

[CR39] 39.Liu, Z., Ozay, N., Sontag, E.D.: Properties of immersions for systems with multiple limit sets with implications to learning Koopman embeddings pp. 1–14. arXiv:2312.17045 (2024)

[CR40] 40.Mezić, I.: Analysis of fluid flows via spectral properties of the Koopman operator. Ann. Rev. Fluid Mech. 45(1), 357–378 (2013) [Google Scholar]

[CR41] 41.Newhouse, S.E.: On a differentiable linearization theorem of Philip Hartman. Contemp. Math. 692, 209–262 (2017) [Google Scholar]

[CR42] 42.Otto, S.E., Padovan, A., Rowley, C.W.: Optimizing oblique projections for nonlinear systems using trajectories. SIAM J. Sci. Comput. 44(3), A1681–A1702 (2022) [Google Scholar]

[CR43] 43.Otto, S.E., Padovan, A., Rowley, C.W.: Model reduction for nonlinear systems by balanced truncation of state and gradient covariance. SIAM J. Sci. Comput. 45(5), A2325–A2355 (2023) [Google Scholar]

[CR44] 44.Page, J., Kerswell, R.R.: Koopman mode expansions between simple invariant solutions. J. Fluid Mech. 879, 1–27 (2019) [Google Scholar]

[CR45] 45.Paszke, A., Gross, S., Chintala, S., Chanan, G., Yang, E., DeVito, Z., Lin, Z., Desmaison, A., Antiga, L., Lerer, A.: Automatic differentiation in pytorch. In: NIPS-W (2017)

[CR46] 46.Poincaré, H.: Les Méthodes Nouvelles de la Mécanique Céleste. Gauthier-Villars et Fils, Paris (1892) [Google Scholar]

[CR47] 47.Ponsioen, S., Pedergnana, T., Haller, G.: Automated computation of autonomous spectral submanifolds for nonlinear modal analysis. J. Sound Vib. 420, 269–295 (2018) [Google Scholar]

[CR48] 48.Rowley, C.W.: Model reduction for fluids, using balanced proper orthogonal decomposition. Int. J. Bifurc. Chaos 15(03), 997–1013 (2005) [Google Scholar]

[CR49] 49.Rowley, C.W., Mezić, I., Bagheri, S., Schlachter, P., Henningson, D.S.: Spectral analysis of nonlinear flows. J. Fluid Mech. 641, 115–127 (2009) [Google Scholar]

[CR50] 50.Schmid, P.J.: Dynamic mode decomposition of numerical and experimental data. J. Fluid Mech. 656, 5–28 (2010) [Google Scholar]

[CR51] 51.Schmid, P.J.: Dynamic mode decomposition and its variants. Ann. Rev. Fluid Mech. 54, 225–254 (2022) [Google Scholar]

[CR52] 52.Sternberg, S.: Local contractions and theorem of Poincaré. Am. J. Math. 79(4), 809–824 (1957) [Google Scholar]

[CR53] 53.Sternberg, S.: On the structure of local homeomorphisms of Euclidean -space. II. Am. J. Math. 80(3), 623–631 (1958) [Google Scholar]

[CR54] 54.Szaksz, B.: The stabilizing and destabilizing effects of time delays in nonlinear dynamical systems. Ph.D. Thesis, Budapest University of Technology and Economics (2024)

[CR55] 55.Szalai, R.: Invariant spectral foliations with applications to model order reduction and synthesis. Nonlinear Dyn. 101, 2645–2669 (2020) [Google Scholar]

[CR56] 56.van Strien, S.: Smooth linearization of hyperbolic fixed points without resonance conditions. J. Differ. Equ. 85(1), 66–90 (1990) [Google Scholar]

[CR57] 57.Virtanen, P., Gommers, R., Oliphant, T.E., Haberland, M., Reddy, T., Cournapeau, D., Burovski, E., Peterson, P., Weckesser, W., Bright, J., van der Walt, S.J., Brett, M., Wilson, J., Jarrod Millman, K., Mayorov, N., Nelson, A.R.J., Jones, E., Kern, R., Larson, E., Carey, C.J., Polat, İ, Feng, Y., Moore, E.W., VanderPlas, J., Laxalde, D., Perktold, J., Cimrman, R., Henriksen, I., Quintero, E.A., Harris, C.R., Archibald, A.M., Ribeiro, A.H., Pedregosa, F., van Mulbregt, P.: SciPy 1.0 Contributors. SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat. Methods 17, 261–272 (2020) [DOI] [PMC free article] [PubMed]

[CR58] 58.Williams, M.O., Kevrekidis, I.G., Rowley, C.W.: A data-driven approximation of the Koopman operator: extending dynamic mode decomposition. J. Nonlinear Sci. 9, 1307–1346 (2015) [Google Scholar]

[CR59] 59.Williams, M.O., Rowley, C.W., Kevrekidis, I.G.: A kernel-based method for data-driven Koopman spectral analysis. J. Comput. Dyn. 2, 247–265 (2015) [Google Scholar]

[CR60] 60.Wu, Z., Brunton, S.L., Revzen, S.: Challenges in dynamic mode decomposition. J. R. Soc. Interface 18(185), 20210686 (2021) [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Data-driven linearization of dynamical systems

George Haller

Bálint Kaszás

Abstract

Supplementary Information

Introduction

Prior justifications for DMD methods

A simple justification for the DMD algorithm

Fig. 1.

Justification of DMD for continuous dynamical systems

Theorem 1

Proof

Remark 1

Remark 2

Justification of DMD for discrete and for time-periodic continuous dynamical systems

Theorem 2

Proof

Justification of DMD for infinite-dimensional dynamical systems

Theorem 3

Proof

Data-driven linearization (DDL)

Theoretical foundation for DDL

Theorem 4

Proof

DDL versus EDMD

Implementation and applications of DDL

Basic implementation of DDL for model reduction and linearization

Algorithm 1.

Remark 3

Relationship with DMD implementations

Using DDL to construct spectral foliations

Fig. 2.

Using DDL to predict nonlinear forced response from unforced data

Examples

1D nonlinear system with two isolated fixed points

Fig. 3.

3D linear system studied via nonlinear observables

Fig. 4.

Damped and periodically forced Duffing equation

Fig. 5.

Fig. 6.

Water sloshing experiment in a tank

Fig. 7.

Model reduction and foliation in a nonlinear oscillator chain

Fig. 8.

Fig. 9.

Conclusions

Supplementary Information

Acknowledgements

Funding

Data availability

Declarations

Conflict of interest

Footnotes

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases