Learning Hidden States in a Chaotic System: A Physics-Informed Echo State Network Approach

Nguyen Anh Khoa Doan; Wolfgang Polifke; Luca Magri

doi:10.1007/978-3-030-50433-5_9

. 2020 May 25;12142:117–123. doi: 10.1007/978-3-030-50433-5_9

Learning Hidden States in a Chaotic System: A Physics-Informed Echo State Network Approach

Nguyen Anh Khoa Doan ^15,^16,^✉, Wolfgang Polifke ¹⁵, Luca Magri ^16,¹⁷

Editors: Valeria V Krzhizhanovskaya⁸, Gábor Závodszky⁹, Michael H Lees¹⁰, Jack J Dongarra¹¹, Peter M A Sloot¹², Sérgio Brissos¹³, João Teixeira¹⁴

PMCID: PMC7304749

Abstract

We extend the Physics-Informed Echo State Network (PI-ESN) framework to reconstruct the evolution of an unmeasured state (hidden state) in a chaotic system. The PI-ESN is trained by using (i) data, which contains no information on the unmeasured state, and (ii) the physical equations of a prototypical chaotic dynamical system. Non-noisy and noisy datasets are considered. First, it is shown that the PI-ESN can accurately reconstruct the unmeasured state. Second, the reconstruction is shown to be robust with respect to noisy data, which means that the PI-ESN acts as a denoiser. This paper opens up new possibilities for leveraging the synergy between physical knowledge and machine learning to enhance the reconstruction and prediction of unmeasured states in chaotic dynamical systems.

Keywords: Echo state networks, Physics-Informed Echo State Networks, Chaotic dynamical systems, State reconstruction

Introduction

In experiments on physical systems, it is often difficult to measure all the physical states, whether it be because the instruments have a finite resolution, or because the measurement techniques have some limitations. Consequently, we are typically able to infer only a few states of the system from the measured observable quantities. The states that cannot be measured are hidden, that is, they may affect the system’s evolution, but they cannot be straightforwardly measured. The accurate reconstruction of hidden states is crucial in many fields such as cardiac blood flow modelling [13], climate science [6], and fluid dynamics [2], to name only a few. For example, in fluid dynamics, measurements of the velocity field with particle image velocimetry may be limited to the in-plane two-dimensional velocity, although the three-dimensional velocity is the quantity of interest. The reconstruction of unmeasured quantities from experimental measurements has been the subject of recent studies, that used a variety of data assimilation and/or machine learning techniques. For example, spectral nudging, which combines data assimilation with physical equations, was used to infer temperature and rotation rate in 3D isotropic rotating turbulence [3]. Alternatively, [5] reconstructed the fine-scale features of an unsteady flow from large scale information by using a series of Convolutional Neural Networks. Using a similar approach, the reconstruction of the velocity from hydroxyl-radical planar laser induced fluorescence images in a turbulent flame was performed [1]. Another approach based on echo state networks has also been used for the reconstruction of time series of unmeasured states of chaotic systems [9]. While effective in reconstructing the unmeasured states, these approaches required training data with both the measured and unmeasured states. In this paper, we propose using physical knowledge to reconstruct hidden states in a chaotic system without the need of any data of the unmeasured states during the training. This is performed with the Physics-Informed Echo State Network (PI-ESN), which has been shown to accurately forecast chaotic systems [4]. The PI-ESN, and more generally Physics-Informed Machine Learning, relies on the physical knowledge of the system under study, in the form of its conservation equations, whose residuals are included in the loss function during the training of the machine learning framework [4, 12]. These approaches, which combine physical knowledge and machine learning, have been shown to be efficient in improving the accuracy of neural networks [4, 12]. Here the PI-ESN approach is applied to the Lorenz system, which is a prototypical chaotic system [8].

The paper is organized as follows. The problem statement and the methodology based on PI-ESN are detailed in Sect. 2. Then, results are presented and discussed in Sect. 3 and final comments are summarized in Sect. 4.

Methodology: Physics-Informed Echo State Network for Learning of Hidden States

We consider a dynamical system whose governing equations are:

where Inline graphic is a non-linear operator, is the time derivative and is a nonlinear differential operator. Equation (1) represents a formal ordinary differential equation, which governs the dynamics of a nonlinear system. It is assumed that only a subset of the system states can be observed, which is denoted Inline graphic , while the hidden states are denoted . The full state vector is , which is the concatenation of and , i.e., . The vectors’ dimensions are related by . The objective is to train a PI-ESN to reconstruct the hidden states, . We assume that we have training data of the measured states Inline graphic only, where are the discrete time instants that span from 0 to , where is the sampling time. Thus, the specific goal for the PI-ESN is to reconstruct the hidden time series, , for the same time instants. To solve this problem, the PI-ESN of [4], which is based on the data-only ESN of [10], needs to be extended, as explained next.

The PI-ESN is composed of three main parts (Fig. 1): (i) an artificial high dimensional dynamical system, i.e., the reservoir, whose neurons’ (or units’) states at time n are represented by a vector, Inline graphic , representing the reservoir neuron activations; (ii) an input matrix, , and (iii) an output matrix, . The reservoir is coupled to the input signal, , via . A bias term is added to the input to excite the reservoir with a constant signal. The output of the PI-ESN, , is a linear combination of the reservoir states, inputs and an additional bias:

where [; ] indicates a vertical concatenation and Inline graphic denotes the predictions from the PI-ESN. The PI-ESN outputs both the measured states, , and the hidden states, (Eq. (2)). The reservoir states evolve as:

where Inline graphic is the recurrent weight matrix and the (element-wise) function is the activation function for the reservoir neurons. Because we wish to predict a dynamical system, the input data for the PI-ESN corresponds to the measured system state at the previous time instant, , which is only a subset of the state vector. In the ESN approach [10], the input and recurrent matrices, Inline graphic and , are randomly initialized once and are not trained. Only is trained. The sparse matrices and are constructed to satisfy the Echo State Property [10]. Following [11], is generated such that each row of the matrix has only one randomly chosen nonzero element, which is independently taken from a uniform distribution in the interval Inline graphic . Matrix is constructed with an average connectivity , and the non-zero elements are taken from a uniform distribution over the interval . All the coefficients of are then multiplied by a constant coefficient for the largest absolute eigenvalue of , i.e. the spectral radius, to be equal to a value Inline graphic , which is typically smaller than (or equal to) 1. To train the PI-ESN, hence , a combination of the data available and the physical knowledge of the system is used: the components of are computed such that they minimize the sum of (i) the error between the PI-ESN prediction and the measured system states, Inline graphic , and (ii) the physical residual, , on the prediction of the ESN, :

where Inline graphic is the Euclidean norm. The training of the PI-ESN for the reconstruction of hidden states is initialized as follows. Matrix is split into two partitions and , i.e. , which are responsible for the prediction of the observed states, , and the hidden states, , respectively. is initialized by Ridge regression of the data available for the measured states

where Inline graphic and are respectively the horizontal concatenation of the measured states, , and associated ESN states, inputs signals and biases, at the different time instants during training; is the Tikhonov regularization factor [10]; and is the identity matrix. Matrix is randomly initialized to provide an initial guess for the optimization of Inline graphic . The optimization process modifies the components of to obtain the hidden states, while ensuring that the predictions on the hidden states satisfy the physical equations. The optimization is performed with a stochastic gradient method (the Adam-optimizer [7]) with a learning rate of 0.0001.

Fig. 1. — Schematic of the ESN. indicates the bias.

Results and Discussions

The approach described in Sect. 2 is tested for the reconstruction of the chaotic Lorenz system, which is described by [8]:

where Inline graphic , and . The size of the training dataset is with a timestep between two time instants of . An explicit Euler scheme is used to obtain this dataset. We assume that only measurements of and are available for the training of the PI-ESN and the state is to be reconstructed. The parameters of the reservoir of the PI-ESN are taken to be: Inline graphic , and . For the initialization of via Ridge regression, a value of is used for the Tikhonov regularization. These values of the hyperparameters are taken from previous studies [9], who performed a grid search.

Reconstruction of Hidden States

In Fig. 2 where the time is normalized by the largest Lyapunov exponent, Inline graphic , the reconstructed time series is shown for the last 10% of the training data for PI-ESNs with reservoirs of 50 and 600 units. (The dominant Lyapunov exponent is the exponential divergence rate of two system trajectories, which are initially infinitesimally close to each other.) The small PI-ESN (50 units) can satisfactorily reconstruct the hidden state, Inline graphic . The accuracy slightly deteriorates when has very large minima or maxima (e.g., ). However, the large PI-ESN (600 units) shows an improved accuracy. The ability of the PI-ESN to reconstruct , which is not present in the training data, is a key-result. The reconstruction is enabled exclusively by the knowledge of the physical equation, which is constrained into the training of the PI-ESN. This constraint allows the PI-ESN to deduce the evolution of Inline graphic from and . Conversely, with neither the physical equation nor training data for , a data-only ESN cannot learn and reconstruct because it has no information on it.

Effect of Noise

As the ultimate objective is to work with real-world experimental data, the effect of noise on the results is investigated. The training data for Inline graphic and are modified by adding Gaussian noise to the original signal to imitate additive measurements noise. Two Signal-to-Noise Ratios (SNRs) of 20 dB and 40 dB are considered. The results of the reconstructed time series from the PI-ESN trained with the noisy training data are presented in Fig. 3. Despite the presence of noise in the training data, the PI-ESN well reconstructs the non-noisy Inline graphic signal. This means that the physical constraints in Eq. (4) act as a physics-based smoother (or denoiser) of the noisy data. This can be appreciated also in the prediction of measured states. Figure 3b shows the prediction of state : the non-noisy original data (full black line) and the prediction from the PI-ESN (dashed red line) overlap. This means that the PI-ESN provides a denoised prediction after training. Finally, Fig. 4 shows the root mean squared error of the reconstructed hidden state Inline graphic , , for PI-ESNs of different reservoir sizes and noise levels, where is the reference non-noisy data, which we wish to recover. For the non-noisy case, there is a large decrease in the RMSE when the PI-ESNs has 300 units (or more). With noise, the performance between the non-noisy and low-noise ( Inline graphic dB) cases are similar, whereas for a larger noise level ( dB), a larger reservoir is required to keep the RMSE small, as it may be expected. This suggests that the PI-ESN approach may be robust with respect to noise.

Fig. 3. — (a) Reconstruction of with PI-ESN of 600 units trained from noisy data (with zoomed inset). (b) Prediction of .

Fig. 4. — RMSE of the reconstructed time series in the training data.

Conclusions and Future Directions

We extend the Physics-Informed Echo State Network to reconstruct the hidden states in a chaotic dynamical system. The approach combines the knowledge of the system’s physical equations and a small dataset. It is shown, on a prototypical chaotic system, that this method can (i) accurately reconstruct the hidden states; (ii) accurately reconstruct the states with training data contaminated by noise; and (iii) provide a physics-based smoothing of the noisy measured data. Compared to other reconstruction approaches, the proposed framework does not require any data of the hidden states during training. This has the potential to enable the reconstruction of unmeasured quantities in experiments of higher dimensional chaotic systems, such as fluids. This is being explored in on-going studies. Future work also aims at assessing the effect of imperfect physical knowledge on the reconstruction of the hidden states.

This paper opens up new possibilities for the reconstruction and prediction of unsteady dynamics from partial and noisy measurements.

Footnotes

The authors acknowledge the support of the Technical University of Munich - Institute for Advanced Study, funded by the German Excellence Initiative and the European Union Seventh Framework Programme under grant agreement no. 291763. L.M. also acknowledges the Royal Academy of Engineering Research Fellowship Scheme.

L. Magri—(visiting) Institute for Advanced Study, Technical University of Munich, Germany.

Contributor Information

Valeria V. Krzhizhanovskaya, Email: V.Krzhizhanovskaya@uva.nl

Gábor Závodszky, Email: G.Zavodszky@uva.nl.

Michael H. Lees, Email: m.h.lees@uva.nl

Jack J. Dongarra, Email: dongarra@icl.utk.edu

Peter M. A. Sloot, Email: p.m.a.sloot@uva.nl

Sérgio Brissos, Email: sergio.brissos@intellegibilis.com.

João Teixeira, Email: joao.teixeira@intellegibilis.com.

Nguyen Anh Khoa Doan, Email: doan@tfd.mw.tum.de.

References

1.Barwey, S., Hassanaly, M., Raman, V., Steinberg, A.: Using machine learning to construct velocity fields from OH-PLIF images. Combust. Sci. Technol. 1–24 (2019)
2.Brenner MP. Perspective on machine learning for advancing fluid mechanics. Phys. Rev. Fluids. 2019;4(10):100501. doi: 10.1103/PhysRevFluids.4.100501. [DOI] [Google Scholar]
3.Clark Di Leoni P, Mazzino A, Biferale L. Inferring flow parameters and turbulent configuration with physics-informed data assimilation and spectral nudging. Phys. Rev. Fluids. 2018;3:104604. doi: 10.1103/PhysRevFluids.3.104604. [DOI] [Google Scholar]
4.Doan NAK, Polifke W, Magri L, et al. Physics-informed echo state networks for chaotic systems forecasting. In: Rodrigues JMF, et al., editors. Computational Science – ICCS 2019; Cham: Springer; 2019. pp. 192–198. [Google Scholar]
5.Fukami K, Fukagata K, Taira K. Super-resolution reconstruction of turbulent flows with machine learning. J. Fluid Mech. 2019;870:106–120. doi: 10.1017/jfm.2019.238. [DOI] [Google Scholar]
6.Kalnay E. Atmospheric Modeling, Data Assimilation, and Predictability. Cambridge: Cambridge University Press; 2003. [Google Scholar]
7.Kingma, D.P., Ba, J.L.: Adam: a method for stochastic optimization. In: 3rd International Conference on Learning Representations, ICLR 2015 - Conference Track Proceedings, pp. 1–15 (2015)
8.Lorenz EN. Deterministic nonperiodic flow. J. Atmos. Sci. 1963;20(2):130–141. doi: 10.1175/1520-0469(1963)020<0130:DNF>2.0.CO;2. [DOI] [Google Scholar]
9.Lu Z, Pathak J, Hunt B, Girvan M, Brockett R, Ott E. Reservoir observers: model-free inference of unmeasured variables in chaotic systems. Chaos. 2017;27(4):041102. doi: 10.1063/1.4979665. [DOI] [PubMed] [Google Scholar]
10.Lukoševičius M, Jaeger H. Reservoir computing approaches to recurrent neural network training. Comput. Sci. Rev. 2009;3(3):127–149. doi: 10.1016/j.cosrev.2009.03.005. [DOI] [Google Scholar]
11.Pathak J, et al. Hybrid forecasting of chaotic processes: using machine learning in conjunction with a knowledge-based model. Chaos. 2018;28(4):041101. doi: 10.1063/1.5028373. [DOI] [PubMed] [Google Scholar]
12.Raissi M, Perdikaris P, Karniadakis G. Physics-informed neural networks: a deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J. Comput. Phys. 2019;378:686–707. doi: 10.1016/j.jcp.2018.10.045. [DOI] [Google Scholar]
13.Sankaran S, Moghadam ME, Kahn AM, Tseng AM, Guccione JM. Patient-specific multiscale modeling of blood flow for coronary artery bypass graft surgery. Ann. Biomed. Eng. 2012;40(10):2228–2242. doi: 10.1007/s10439-012-0579-3. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR1] 1.Barwey, S., Hassanaly, M., Raman, V., Steinberg, A.: Using machine learning to construct velocity fields from OH-PLIF images. Combust. Sci. Technol. 1–24 (2019)

[CR2] 2.Brenner MP. Perspective on machine learning for advancing fluid mechanics. Phys. Rev. Fluids. 2019;4(10):100501. doi: 10.1103/PhysRevFluids.4.100501. [DOI] [Google Scholar]

[CR3] 3.Clark Di Leoni P, Mazzino A, Biferale L. Inferring flow parameters and turbulent configuration with physics-informed data assimilation and spectral nudging. Phys. Rev. Fluids. 2018;3:104604. doi: 10.1103/PhysRevFluids.3.104604. [DOI] [Google Scholar]

[CR4] 4.Doan NAK, Polifke W, Magri L, et al. Physics-informed echo state networks for chaotic systems forecasting. In: Rodrigues JMF, et al., editors. Computational Science – ICCS 2019; Cham: Springer; 2019. pp. 192–198. [Google Scholar]

[CR5] 5.Fukami K, Fukagata K, Taira K. Super-resolution reconstruction of turbulent flows with machine learning. J. Fluid Mech. 2019;870:106–120. doi: 10.1017/jfm.2019.238. [DOI] [Google Scholar]

[CR6] 6.Kalnay E. Atmospheric Modeling, Data Assimilation, and Predictability. Cambridge: Cambridge University Press; 2003. [Google Scholar]

[CR7] 7.Kingma, D.P., Ba, J.L.: Adam: a method for stochastic optimization. In: 3rd International Conference on Learning Representations, ICLR 2015 - Conference Track Proceedings, pp. 1–15 (2015)

[CR8] 8.Lorenz EN. Deterministic nonperiodic flow. J. Atmos. Sci. 1963;20(2):130–141. doi: 10.1175/1520-0469(1963)020<0130:DNF>2.0.CO;2. [DOI] [Google Scholar]

[CR9] 9.Lu Z, Pathak J, Hunt B, Girvan M, Brockett R, Ott E. Reservoir observers: model-free inference of unmeasured variables in chaotic systems. Chaos. 2017;27(4):041102. doi: 10.1063/1.4979665. [DOI] [PubMed] [Google Scholar]

[CR10] 10.Lukoševičius M, Jaeger H. Reservoir computing approaches to recurrent neural network training. Comput. Sci. Rev. 2009;3(3):127–149. doi: 10.1016/j.cosrev.2009.03.005. [DOI] [Google Scholar]

[CR11] 11.Pathak J, et al. Hybrid forecasting of chaotic processes: using machine learning in conjunction with a knowledge-based model. Chaos. 2018;28(4):041101. doi: 10.1063/1.5028373. [DOI] [PubMed] [Google Scholar]

[CR12] 12.Raissi M, Perdikaris P, Karniadakis G. Physics-informed neural networks: a deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J. Comput. Phys. 2019;378:686–707. doi: 10.1016/j.jcp.2018.10.045. [DOI] [Google Scholar]

[CR13] 13.Sankaran S, Moghadam ME, Kahn AM, Tseng AM, Guccione JM. Patient-specific multiscale modeling of blood flow for coronary artery bypass graft surgery. Ann. Biomed. Eng. 2012;40(10):2228–2242. doi: 10.1007/s10439-012-0579-3. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Learning Hidden States in a Chaotic System: A Physics-Informed Echo State Network Approach

Nguyen Anh Khoa Doan

Wolfgang Polifke

Luca Magri

Abstract

Introduction

Methodology: Physics-Informed Echo State Network for Learning of Hidden States

Fig. 1.

Results and Discussions

Reconstruction of Hidden States

Fig. 2.

Effect of Noise

Fig. 3.

Fig. 4.

Conclusions and Future Directions

Footnotes

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Learning Hidden States in a Chaotic System: A Physics-Informed Echo State Network Approach

Nguyen Anh Khoa Doan

Wolfgang Polifke

Luca Magri

Abstract

Introduction

Methodology: Physics-Informed Echo State Network for Learning of Hidden States

Fig. 1.

Results and Discussions

Reconstruction of Hidden States

Fig. 2.

Effect of Noise

Fig. 3.

Fig. 4.

Conclusions and Future Directions

Footnotes

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases