Skip to main content
Springer Nature - PMC COVID-19 Collection logoLink to Springer Nature - PMC COVID-19 Collection
. 2020 May 25;12142:117–123. doi: 10.1007/978-3-030-50433-5_9

Learning Hidden States in a Chaotic System: A Physics-Informed Echo State Network Approach

Nguyen Anh Khoa Doan 15,16,, Wolfgang Polifke 15, Luca Magri 16,17
Editors: Valeria V Krzhizhanovskaya8, Gábor Závodszky9, Michael H Lees10, Jack J Dongarra11, Peter M A Sloot12, Sérgio Brissos13, João Teixeira14
PMCID: PMC7304749

Abstract

We extend the Physics-Informed Echo State Network (PI-ESN) framework to reconstruct the evolution of an unmeasured state (hidden state) in a chaotic system. The PI-ESN is trained by using (i) data, which contains no information on the unmeasured state, and (ii) the physical equations of a prototypical chaotic dynamical system. Non-noisy and noisy datasets are considered. First, it is shown that the PI-ESN can accurately reconstruct the unmeasured state. Second, the reconstruction is shown to be robust with respect to noisy data, which means that the PI-ESN acts as a denoiser. This paper opens up new possibilities for leveraging the synergy between physical knowledge and machine learning to enhance the reconstruction and prediction of unmeasured states in chaotic dynamical systems.

Keywords: Echo state networks, Physics-Informed Echo State Networks, Chaotic dynamical systems, State reconstruction

Introduction

In experiments on physical systems, it is often difficult to measure all the physical states, whether it be because the instruments have a finite resolution, or because the measurement techniques have some limitations. Consequently, we are typically able to infer only a few states of the system from the measured observable quantities. The states that cannot be measured are hidden, that is, they may affect the system’s evolution, but they cannot be straightforwardly measured. The accurate reconstruction of hidden states is crucial in many fields such as cardiac blood flow modelling [13], climate science [6], and fluid dynamics [2], to name only a few. For example, in fluid dynamics, measurements of the velocity field with particle image velocimetry may be limited to the in-plane two-dimensional velocity, although the three-dimensional velocity is the quantity of interest. The reconstruction of unmeasured quantities from experimental measurements has been the subject of recent studies, that used a variety of data assimilation and/or machine learning techniques. For example, spectral nudging, which combines data assimilation with physical equations, was used to infer temperature and rotation rate in 3D isotropic rotating turbulence [3]. Alternatively, [5] reconstructed the fine-scale features of an unsteady flow from large scale information by using a series of Convolutional Neural Networks. Using a similar approach, the reconstruction of the velocity from hydroxyl-radical planar laser induced fluorescence images in a turbulent flame was performed [1]. Another approach based on echo state networks has also been used for the reconstruction of time series of unmeasured states of chaotic systems [9]. While effective in reconstructing the unmeasured states, these approaches required training data with both the measured and unmeasured states. In this paper, we propose using physical knowledge to reconstruct hidden states in a chaotic system without the need of any data of the unmeasured states during the training. This is performed with the Physics-Informed Echo State Network (PI-ESN), which has been shown to accurately forecast chaotic systems [4]. The PI-ESN, and more generally Physics-Informed Machine Learning, relies on the physical knowledge of the system under study, in the form of its conservation equations, whose residuals are included in the loss function during the training of the machine learning framework [4, 12]. These approaches, which combine physical knowledge and machine learning, have been shown to be efficient in improving the accuracy of neural networks [4, 12]. Here the PI-ESN approach is applied to the Lorenz system, which is a prototypical chaotic system [8].

The paper is organized as follows. The problem statement and the methodology based on PI-ESN are detailed in Sect. 2. Then, results are presented and discussed in Sect. 3 and final comments are summarized in Sect. 4.

Methodology: Physics-Informed Echo State Network for Learning of Hidden States

We consider a dynamical system whose governing equations are:

graphic file with name M1.gif 1

where Inline graphic is a non-linear operator, Inline graphic is the time derivative and Inline graphic is a nonlinear differential operator. Equation (1) represents a formal ordinary differential equation, which governs the dynamics of a nonlinear system. It is assumed that only a subset of the system states can be observed, which is denoted Inline graphic, while the hidden states are denoted Inline graphic. The full state vector is Inline graphic, which is the concatenation of Inline graphic and Inline graphic, i.e., Inline graphic. The vectors’ dimensions are related by Inline graphic. The objective is to train a PI-ESN to reconstruct the hidden states, Inline graphic. We assume that we have training data of the measured states Inline graphic only, where Inline graphic are the discrete time instants that span from 0 to Inline graphic, where Inline graphic is the sampling time. Thus, the specific goal for the PI-ESN is to reconstruct the hidden time series, Inline graphic, for the same time instants. To solve this problem, the PI-ESN of [4], which is based on the data-only ESN of [10], needs to be extended, as explained next.

The PI-ESN is composed of three main parts (Fig. 1): (i) an artificial high dimensional dynamical system, i.e., the reservoir, whose neurons’ (or units’) states at time n are represented by a vector, Inline graphic, representing the reservoir neuron activations; (ii) an input matrix, Inline graphic, and (iii) an output matrix, Inline graphic. The reservoir is coupled to the input signal, Inline graphic, via Inline graphic. A bias term is added to the input to excite the reservoir with a constant signal. The output of the PI-ESN, Inline graphic, is a linear combination of the reservoir states, inputs and an additional bias:

graphic file with name M24.gif 2

where [; ] indicates a vertical concatenation and Inline graphic denotes the predictions from the PI-ESN. The PI-ESN outputs both the measured states, Inline graphic, and the hidden states, Inline graphic (Eq. (2)). The reservoir states evolve as:

graphic file with name M28.gif 3

where Inline graphic is the recurrent weight matrix and the (element-wise) Inline graphic function is the activation function for the reservoir neurons. Because we wish to predict a dynamical system, the input data for the PI-ESN corresponds to the measured system state at the previous time instant, Inline graphic, which is only a subset of the state vector. In the ESN approach [10], the input and recurrent matrices, Inline graphic and Inline graphic, are randomly initialized once and are not trained. Only Inline graphic is trained. The sparse matrices Inline graphic and Inline graphic are constructed to satisfy the Echo State Property [10]. Following [11], Inline graphic is generated such that each row of the matrix has only one randomly chosen nonzero element, which is independently taken from a uniform distribution in the interval Inline graphic. Matrix Inline graphic is constructed with an average connectivity Inline graphic, and the non-zero elements are taken from a uniform distribution over the interval Inline graphic. All the coefficients of Inline graphic are then multiplied by a constant coefficient for the largest absolute eigenvalue of Inline graphic, i.e. the spectral radius, to be equal to a value Inline graphic, which is typically smaller than (or equal to) 1. To train the PI-ESN, hence Inline graphic, a combination of the data available and the physical knowledge of the system is used: the components of Inline graphic are computed such that they minimize the sum of (i) the error between the PI-ESN prediction and the measured system states, Inline graphic, and (ii) the physical residual, Inline graphic, on the prediction of the ESN, Inline graphic:

graphic file with name M50.gif 4

where Inline graphic is the Euclidean norm. The training of the PI-ESN for the reconstruction of hidden states is initialized as follows. Matrix Inline graphic is split into two partitions Inline graphic and Inline graphic, i.e. Inline graphic, which are responsible for the prediction of the observed states, Inline graphic, and the hidden states, Inline graphic, respectively. Inline graphic is initialized by Ridge regression of the data available for the measured states

graphic file with name M59.gif 5

where Inline graphic and Inline graphic are respectively the horizontal concatenation of the measured states, Inline graphic, and associated ESN states, inputs signals and biases, Inline graphic at the different time instants during training; Inline graphic is the Tikhonov regularization factor [10]; and Inline graphic is the identity matrix. Matrix Inline graphic is randomly initialized to provide an initial guess for the optimization of Inline graphic. The optimization process modifies the components of Inline graphic to obtain the hidden states, while ensuring that the predictions on the hidden states satisfy the physical equations. The optimization is performed with a stochastic gradient method (the Adam-optimizer [7]) with a learning rate of 0.0001.

Fig. 1.

Fig. 1.

Schematic of the ESN. Inline graphic indicates the bias.

Results and Discussions

The approach described in Sect. 2 is tested for the reconstruction of the chaotic Lorenz system, which is described by [8]:

graphic file with name M69.gif 6

where Inline graphic, Inline graphic and Inline graphic. The size of the training dataset is Inline graphic with a timestep between two time instants of Inline graphic. An explicit Euler scheme is used to obtain this dataset. We assume that only measurements of Inline graphic and Inline graphic are available for the training of the PI-ESN and the state Inline graphic is to be reconstructed. The parameters of the reservoir of the PI-ESN are taken to be: Inline graphic, Inline graphic and Inline graphic. For the initialization of Inline graphic via Ridge regression, a value of Inline graphic is used for the Tikhonov regularization. These values of the hyperparameters are taken from previous studies [9], who performed a grid search.

Reconstruction of Hidden States

In Fig. 2 where the time is normalized by the largest Lyapunov exponent, Inline graphic, the reconstructed Inline graphic time series is shown for the last 10% of the training data for PI-ESNs with reservoirs of 50 and 600 units. (The dominant Lyapunov exponent is the exponential divergence rate of two system trajectories, which are initially infinitesimally close to each other.) The small PI-ESN (50 units) can satisfactorily reconstruct the hidden state, Inline graphic. The accuracy slightly deteriorates when Inline graphic has very large minima or maxima (e.g., Inline graphic). However, the large PI-ESN (600 units) shows an improved accuracy. The ability of the PI-ESN to reconstruct Inline graphic, which is not present in the training data, is a key-result. The reconstruction is enabled exclusively by the knowledge of the physical equation, which is constrained into the training of the PI-ESN. This constraint allows the PI-ESN to deduce the evolution of Inline graphic from Inline graphic and Inline graphic. Conversely, with neither the physical equation nor training data for Inline graphic, a data-only ESN cannot learn and reconstruct Inline graphic because it has no information on it.

Fig. 2.

Fig. 2.

Reconstruction of Inline graphic.

Effect of Noise

As the ultimate objective is to work with real-world experimental data, the effect of noise on the results is investigated. The training data for Inline graphic and Inline graphic are modified by adding Gaussian noise to the original signal to imitate additive measurements noise. Two Signal-to-Noise Ratios (SNRs) of 20 dB and 40 dB are considered. The results of the reconstructed Inline graphic time series from the PI-ESN trained with the noisy training data are presented in Fig. 3. Despite the presence of noise in the training data, the PI-ESN well reconstructs the non-noisy Inline graphic signal. This means that the physical constraints in Eq. (4) act as a physics-based smoother (or denoiser) of the noisy data. This can be appreciated also in the prediction of measured states. Figure 3b shows the prediction of state Inline graphic: the non-noisy original data (full black line) and the prediction from the PI-ESN (dashed red line) overlap. This means that the PI-ESN provides a denoised prediction after training. Finally, Fig. 4 shows the root mean squared error of the reconstructed hidden state Inline graphic, Inline graphic, for PI-ESNs of different reservoir sizes and noise levels, where Inline graphic is the reference non-noisy data, which we wish to recover. For the non-noisy case, there is a large decrease in the RMSE when the PI-ESNs has 300 units (or more). With noise, the performance between the non-noisy and low-noise (Inline graphic dB) cases are similar, whereas for a larger noise level (Inline graphic dB), a larger reservoir is required to keep the RMSE small, as it may be expected. This suggests that the PI-ESN approach may be robust with respect to noise.

Fig. 3.

Fig. 3.

(a) Reconstruction of Inline graphic with PI-ESN of 600 units trained from noisy data (with zoomed inset). (b) Prediction of Inline graphic.

Fig. 4.

Fig. 4.

RMSE of the reconstructed Inline graphic time series in the training data.

Conclusions and Future Directions

We extend the Physics-Informed Echo State Network to reconstruct the hidden states in a chaotic dynamical system. The approach combines the knowledge of the system’s physical equations and a small dataset. It is shown, on a prototypical chaotic system, that this method can (i) accurately reconstruct the hidden states; (ii) accurately reconstruct the states with training data contaminated by noise; and (iii) provide a physics-based smoothing of the noisy measured data. Compared to other reconstruction approaches, the proposed framework does not require any data of the hidden states during training. This has the potential to enable the reconstruction of unmeasured quantities in experiments of higher dimensional chaotic systems, such as fluids. This is being explored in on-going studies. Future work also aims at assessing the effect of imperfect physical knowledge on the reconstruction of the hidden states.

This paper opens up new possibilities for the reconstruction and prediction of unsteady dynamics from partial and noisy measurements.

Footnotes

The authors acknowledge the support of the Technical University of Munich - Institute for Advanced Study, funded by the German Excellence Initiative and the European Union Seventh Framework Programme under grant agreement no. 291763. L.M. also acknowledges the Royal Academy of Engineering Research Fellowship Scheme.

L. Magri—(visiting) Institute for Advanced Study, Technical University of Munich, Germany.

Contributor Information

Valeria V. Krzhizhanovskaya, Email: V.Krzhizhanovskaya@uva.nl

Gábor Závodszky, Email: G.Zavodszky@uva.nl.

Michael H. Lees, Email: m.h.lees@uva.nl

Jack J. Dongarra, Email: dongarra@icl.utk.edu

Peter M. A. Sloot, Email: p.m.a.sloot@uva.nl

Sérgio Brissos, Email: sergio.brissos@intellegibilis.com.

João Teixeira, Email: joao.teixeira@intellegibilis.com.

Nguyen Anh Khoa Doan, Email: doan@tfd.mw.tum.de.

References

  • 1.Barwey, S., Hassanaly, M., Raman, V., Steinberg, A.: Using machine learning to construct velocity fields from OH-PLIF images. Combust. Sci. Technol. 1–24 (2019)
  • 2.Brenner MP. Perspective on machine learning for advancing fluid mechanics. Phys. Rev. Fluids. 2019;4(10):100501. doi: 10.1103/PhysRevFluids.4.100501. [DOI] [Google Scholar]
  • 3.Clark Di Leoni P, Mazzino A, Biferale L. Inferring flow parameters and turbulent configuration with physics-informed data assimilation and spectral nudging. Phys. Rev. Fluids. 2018;3:104604. doi: 10.1103/PhysRevFluids.3.104604. [DOI] [Google Scholar]
  • 4.Doan NAK, Polifke W, Magri L, et al. Physics-informed echo state networks for chaotic systems forecasting. In: Rodrigues JMF, et al., editors. Computational Science – ICCS 2019; Cham: Springer; 2019. pp. 192–198. [Google Scholar]
  • 5.Fukami K, Fukagata K, Taira K. Super-resolution reconstruction of turbulent flows with machine learning. J. Fluid Mech. 2019;870:106–120. doi: 10.1017/jfm.2019.238. [DOI] [Google Scholar]
  • 6.Kalnay E. Atmospheric Modeling, Data Assimilation, and Predictability. Cambridge: Cambridge University Press; 2003. [Google Scholar]
  • 7.Kingma, D.P., Ba, J.L.: Adam: a method for stochastic optimization. In: 3rd International Conference on Learning Representations, ICLR 2015 - Conference Track Proceedings, pp. 1–15 (2015)
  • 8.Lorenz EN. Deterministic nonperiodic flow. J. Atmos. Sci. 1963;20(2):130–141. doi: 10.1175/1520-0469(1963)020<0130:DNF>2.0.CO;2. [DOI] [Google Scholar]
  • 9.Lu Z, Pathak J, Hunt B, Girvan M, Brockett R, Ott E. Reservoir observers: model-free inference of unmeasured variables in chaotic systems. Chaos. 2017;27(4):041102. doi: 10.1063/1.4979665. [DOI] [PubMed] [Google Scholar]
  • 10.Lukoševičius M, Jaeger H. Reservoir computing approaches to recurrent neural network training. Comput. Sci. Rev. 2009;3(3):127–149. doi: 10.1016/j.cosrev.2009.03.005. [DOI] [Google Scholar]
  • 11.Pathak J, et al. Hybrid forecasting of chaotic processes: using machine learning in conjunction with a knowledge-based model. Chaos. 2018;28(4):041101. doi: 10.1063/1.5028373. [DOI] [PubMed] [Google Scholar]
  • 12.Raissi M, Perdikaris P, Karniadakis G. Physics-informed neural networks: a deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J. Comput. Phys. 2019;378:686–707. doi: 10.1016/j.jcp.2018.10.045. [DOI] [Google Scholar]
  • 13.Sankaran S, Moghadam ME, Kahn AM, Tseng AM, Guccione JM. Patient-specific multiscale modeling of blood flow for coronary artery bypass graft surgery. Ann. Biomed. Eng. 2012;40(10):2228–2242. doi: 10.1007/s10439-012-0579-3. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Computational Science – ICCS 2020 are provided here courtesy of Nature Publishing Group

RESOURCES