Observability Decomposition-Based Decentralized Kalman Filter and Its Application to Resilient State Estimation under Sensor Attacks

Chanhwa Lee

doi:10.3390/s22186909

. 2022 Sep 13;22(18):6909. doi: 10.3390/s22186909

Observability Decomposition-Based Decentralized Kalman Filter and Its Application to Resilient State Estimation under Sensor Attacks

Chanhwa Lee ¹

Editor: Fanglai Zhu¹

PMCID: PMC9502392 PMID: 36146255

Abstract

This paper considers a discrete-time linear time invariant system in the presence of Gaussian disturbances/noises and sparse sensor attacks. First, we propose an optimal decentralized multi-sensor information fusion Kalman filter based on the observability decomposition when there is no sensor attack. The proposed decentralized Kalman filter deploys a bank of local observers who utilize their own single sensor information and generate the state estimate for the observable subspace. In the absence of an attack, the state estimate achieves the minimum variance, and the computational process does not suffer from the divergent error covariance matrix. Second, the decentralized Kalman filter method is applied in the presence of sparse sensor attacks as well as Gaussian disturbances/noises. Based on the redundant observability, an attack detection scheme by the $χ^{2}$ test and a resilient state estimation algorithm by the maximum likelihood decision rule among multiple hypotheses, are presented. The secure state estimation algorithm finally produces a state estimate that is most likely to have minimum variance with an unbiased mean. Simulation results on a motor controlled multiple torsion system are provided to validate the effectiveness of the proposed algorithm.

Keywords: information fusion, decentralized Kalman filter, observability decomposition, attack resilience, secure state estimation, redundant observability, sparse sensor attack

1. Introduction

As control systems operate through network communication and become more complex due to increased connectivity, security against adversarial attacks is becoming more important and receiving attention [1,2,3,4]. In fact, attacks on control systems took place in reality [5,6,7,8], and many studies have been conducted on the security issues of systems whose measurements have been compromised by adversaries because sensors are one of the vulnerable points to malicious attackers in dynamical systems [9,10,11,12,13,14,15].

Among them, the state estimation problem when some of sensors are corrupted by attackers, often called a sparse sensor attack, has been investigated, and several solutions have been recently proposed [10,11,12,13,14,15]. The reference [10] introduces the basic concepts of the secure state estimation problem and formulates it as a non-convex combinatorial optimization problem. The problem is shown to be transformed into a convex optimization problem by using the results developed in the field of compressed sensing [16,17] under additional limiting assumptions. The relationship between this resilient state estimation problem and the notion of strong observability was revealed in [11]. A necessary and sufficient condition for the solvability of this problem is derived in [12,15] with the notion of redundant observability, more specifically, it requires the redundancy of observability twice as much as the sparsity of sensor attacks. A method to alleviate the computational complexity of the logic for finding a combination of non-attacked sensors, is proposed in [13,14]. In [15], the estimator is designed by a set of local observers with only a single sensor, and the decoder uses an error correction algorithm to generate a final state estimate based on the data collected from each local observer.

In addition to sparse sensor attacks, disturbances and noises are considered to enhance the robustness. First, bounded disturbances and noises are considered in [13,15,18], and in particular, the reference [15] explicitly derives the estimation error with the system parameters to provide an analysis of robustness. Second, zero-mean Gaussian white noises and disturbances rather than bounded ones were considered in [19,20,21], and Kalman filters were used to guarantee the state-estimation performance in a probabilistic manner. The reference [19] proposed an estimator with Kalman filters that searches a reliable subset of sensors and operates on the identified subset. A method of combining a secure state estimator and the standard Kalman filter by using the secure state estimator as a pre-filter for the Kalman filter when the set of attacked sensors changes over time, is proposed in [20]. It was shown in [21] that the optimal Kalman estimate can be decomposed into a weighted sum of local estimates, where each estimate uses only a single sensor measurement and that a secure state estimation can be achieved by a convex optimization under some additional assumptions.

This paper considers a general discrete-time linear dynamical system that is corrupted by sparse sensor attacks and Gaussian disturbances/noises. First, we construct local observers on each single sensor and design those local observers with Kalman filters using their own sensor data to cope with Gaussian disturbances/noises. The design of local observers is fully decentralized since it does not utilize any information including Kalman gains or error covariance matrices from other sensors as well as the sensor readings. Furthermore, the local observer’s error covariance is guaranteed not to diverge since it is constructed in the observable subspace based on the observability decomposition, and thus, there is no numerical computational error in practice. Second, a novel information fusion scheme is developed to counteract sparse sensor attacks while maintaining the minimum variance properties. The information fusion center detects the presence of sensor attack in the selected subset of sensors by the $χ^{2}$ test, which is typically used in the area of fault detection [22,23]. If the $χ^{2}$ test concludes that there is an attack in the selected subset, a search algorithm is launched to choose a new index set of sensors that is most likely to be unattacked by the multiple hypothesis test. Each hypothesis produces a state estimate with minimum variance, assuming that the index set is attack-free so that each estimate is unbiased. Therefore, the information fusion scheme finally produces a state estimate that is most likely to have the minimum variance and to be unbiased.

Assuming that there exist only Gaussian disturbances/noises without any attacks, a basic information fusion Kalman filter scheme was proposed in [24,25]. The local observers in [24,25] were designed using a Kalman filter for the entire state variable with a single sensor, and a fusion algorithm generates the optimal state estimate with the minimum variance. However, as highlighted in [26], some components of the error covariance may diverge if a single-sensor system is not observable, and this can induce numerical computation problems in practice. This problem can be solved by reducing the target state space to an observable subspace and designing a Kalman filter for the reduced observable subsystem. The idea of decomposing a single-sensor system into the observable subsystem and the unobservable subsystem was proposed in [15] for the secure state estimator design under bounded disturbances/noises, and in [27] for the distributed Luenberger observer design of sensor networks. Hence, adopting this idea and designing the Kalman filter for the observable subsystem, the problem of divergent error covariance does not occur, and we derive the optimal information fusion algorithm even when the size of the local information is different each other.

The contributions of this paper can be summarized as follows:

(1)
The proposed algorithm successfully estimates the state variable under sparse sensor attacks as well as Gaussian disturbances/noises. Our algorithm ensures the minimum variance, while [19] simply guarantees that its covariance is no worse than the worst case scenario with high probability;
(2)
We only assume that the system is redundant observable, which is known as an equivalent condition for the secure state estimation to be solvable under sparse sensor attacks. Note that [20] requires additional assumptions to reformulate the problem as a convex problem, and further, the combination of Kalman filter and the secure estimator implicitly supposes that the estimation error for the attack signal follows a zero-mean Gaussian distribution, which may not be true when the attack signal is intelligently designed in a coordinated way. The reference [21] needs the system matrix to be nonsingular, and both references [20] and [21] have additional assumptions about the closed-loop system;
(3)
The construction of the local observer is completely decentralized, and the overall size of the observer is relatively small. As the combinatorial logic is embedded in the fusion center, we do not have to prepare all possible combinations of observers. Note that [19] does not utilize any decomposition, and thus, it asks for all combinations of observers. The local decomposition presented in [21] is not fully decentralized because the decomposition is performed using the global information of the output matrix and the Kalman gain;
(4)
As a by-product obtained during the derivation process, the optimal decentralized information fusion Kalman filter scheme is developed based on the observability decomposition. Compared with the results in [24,25], the proposed scheme does not suffer from the numerical computational errors resulting from the diverging error covariance matrix. The algorithm in this paper guarantees that each error covariance matrix in the local observer converges by the observability decomposition, and this method can also be widely used for the multi-sensor information fusion Kalman filters that do not consider any attacks.

The rest of the paper is organized as follows. The remaining of this section introduces the notation used throughout the paper. The system model and problem formulation are given in Section 2. Section 3 presents the optimal multi-sensor information fusion Kalman filter based on the observability decomposition. We then give the attack detection algorithm by $χ^{2}$ test and the attack-resilient state estimation scheme by the multiple hypothesis test in Section 4. Finally, simulation results with a servo motor system are given in Section 5, and we provide our concluding remarks in Section 6. The preliminary results of this paper were studied in [28].

Notation: Throughout this paper, the following notations are adopted. For a set S, the number of elements in the set S is denoted by $| S |$ . For a column vector $y \in R^{p}$ and its $i$ -th element $y_{i}$ , $supp (y)$ denotes the number of nonzero elements of the vector y, that is, $supp (y) : = \{i \in [p] : y_{i} \neq 0\}$ where the symbol $[p]$ is used to represent the subset of natural numbers $\{1, 2, \dots, p\} \subset N$ . The number of nonzero elements of a vector y is defined by the $ℓ_{0}$ norm, and it is written as ${∥ y ∥}_{0} : = | supp (y) |$ . We say that the vector y is $q$ -sparse if its $ℓ_{0}$ norm is less than or equal to $q$ , that is, ${∥ y ∥}_{0} \leq q$ .

For an index set $I \subset [p]$ and a vector $y \in R^{p}$ (or a matrix $C \in R^{p \times n}$ ), $y_{I} \in R^{| I |}$ (or $C_{I} \in R^{| I | \times n}$ ) denotes the vector (or the matrix) obtained from y (or C) by eliminating all $i$ -th rows such that $i \in I^{c}$ . Similarly, for two index sets $I, J \subset [p]$ and a matrix $P \in R^{p \times p}$ , $P_{I, J} \in R^{| I | \times | J |}$ denotes the matrix obtained from P by eliminating all $i$ -th rows and all $j$ -th columns such that $i \in I^{c}$ and $j \in J^{c}$ .

Let a finite sequence $\{μ_{i}\} = \{μ_{1}, μ_{2}, \dots, μ_{p}\}$ with $μ = \sum_{i = 1}^{p} μ_{i}$ given. A stacked vector $z = {[z_{1}^{⊤} z_{2}^{⊤} \dots z_{p}^{⊤}]}^{⊤} \in R^{μ}$ is said to be partitioned by the sequence $\{μ_{i}\}$ if $z_{i} \in R^{μ_{i}}$ for all $i \in [p]$ . For $j \in [p]$ , an index set $I_{j}^{\{μ_{i}\}} : = \{(\sum_{i = 1}^{j - 1} μ_{i}) + 1, (\sum_{i = 1}^{j - 1} μ_{i}) + 2, \dots, \sum_{i = 1}^{j} μ_{i}\} \subset [μ]$ represents the $j$ -th partition among total p partitions when a vector $z \in R^{μ}$ is partitioned by the sequence $\{μ_{i}\}$ . This notation is extended to a subset $J \subset [p]$ where $I_{J}^{\{μ_{i}\}}$ denotes $⋃_{j \in J}^{} I_{j}^{\{μ_{i}\}}$ . A vector $z \in R^{μ}$ partitioned by the sequence $\{μ_{i}\}$ , is said to be ( $\{μ_{i}\}$ -stacked) $q$ -sparse if $|\{j \in [p] : z_{I_{j}^{\{μ_{i}\}}} \neq 0_{μ_{j} \times 1}\}| \leq q$ .

2. System Modeling and Problem Formulation

The plant and the attack model under consideration are presented, and the problem formulation is given in this section.

2.1. Plant Modeling with Gaussian Disturbances and Noises

A discrete-time linear time invariant (LTI) system under Gaussian disturbances and noises given by

P : \{\begin{matrix} x (k + 1) = A x (k) + B u (k) + d (k) \\ y (k) = C x (k) + n (k) \end{matrix}

(1)

is considered. In the plant dynamics of (1), $x \in R^{n}$ is the state variable vector, $u \in R^{m}$ is the control input vector, and $y \in R^{p}$ is the sensor output vector. Furthermore, the dynamics is disrupted by the process disturbance $d \in R^{n}$ , and the sensors are corrupted by the measurement noise $n \in R^{p}$ . There are a total of p sensors that measure the system outputs, and the $i$ -th sensor’s measurement at time k is denoted by

y_{i} (k) = c_{i} x (k) + n_{i} (k)

where $c_{i}$ is the $i$ -th row of the output matrix C, which implies that $C = {[c_{1}^{⊤} c_{2}^{⊤} \dots c_{p}^{⊤}]}^{⊤}$ . Here, stochastic assumptions on the disturbance $d (k)$ , the noise $n (k)$ and the initial state $x (0)$ of the system (1) are formally stated as follows.

Assumption 1.

The disturbance $d (k)$ and measurement noise $n (k)$ are independent and identically distributed (i.i.d.) white Gaussian process with zero-mean and covariance matrices Q and R, respectively. More specifically,

$\begin{matrix} d (k) & \sim N (0_{n \times 1}, & Q), \\ n (k) & \sim N (0_{p \times 1}, & R), \\ E [d (k)] & = 0_{n \times 1}, & E [d (k) d^{⊤} (t)] & = Q δ_{k t}, \\ E [n (k)] & = 0_{p \times 1}, & E [n (k) n^{⊤} (t)] & = R δ_{k t}, \\ E [n (k) d^{⊤} (t)] & = O_{p \times n}, \end{matrix}$

where the symbol $E [\cdot]$ represents the expected value of a random variable and $δ_{k t}$ is the Kronecker delta function. Furthermore, the initial state $x (0)$ is a Gaussian distributed random variable with the mean ${\bar{x}}_{0}$ and covariance matrix $P_{0}$ ,

$\begin{matrix} x (0) & \sim N ({\bar{x}}_{0}, P_{0}), \\ E [x (0)] & = {\bar{x}}_{0}, E [(x (0) - {\bar{x}}_{0}) {(x (0) - {\bar{x}}_{0})}^{⊤}] = P_{0}, \end{matrix}$

and is independent of $d (k)$ and $n (k)$ .

2.2. Attack Modeling with Sparse Sensor Attacks

Among various attack scenarios [3], we consider false data injection attacks on sensors. Adversarial attackers can inject arbitrary inputs to some (not all) sensors so that a part of the measurements is compromised. Some additive inputs may be induced by cyber or physical tampering with the sensors, or adversaries may penetrate into the communication network on the output side of the plant because those communication links are not secure. In both cases, the attack is characterized by the attack vector $a \in R^{p}$ as in

\begin{matrix} \begin{matrix} y^{a} (k) & = y (k) + a (k) \\ = C x (k) + n (k) + a (k) \\ = C x (k) + n^{a} (k) \end{matrix} \end{matrix}

(2)

where $y^{a} \in R^{p}$ denotes sensor readings with a potential attack, while $y \in R^{p}$ is the original healthy sensor data affected by the measurement noise only. Similarly, $n^{a} \in R^{p}$ represents the total sensor contamination signal including both the noise n and the attack a.

Here, it is assumed that the adversaries can compromise only a part of the sensors, not all of them. Assuming that the attacker’s resources are limited, we suppose that the attacker can contaminate up to $q$ out of p measurement outputs. Therefore, a formal condition on the sparsity of the attack vector a can be given as follows.

Assumption 2.

The sensor attack vector $a (k)$ is $q$ -sparse for all $k \geq 0$ , that is, ${∥ a (k) ∥}_{0} \leq q,^{\forall} k \geq 0$ . Moreover, it holds that

$|\{i \in [p] : a_{i} (k) \neq 0 for some k \geq 0\}| \leq q .$

This assumption tells more than ${∥ a (k) ∥}_{0} \leq q$ for all $k \geq 0$ , in the sense that the compromised sensor channels are not altered for all time. In practice, this may be the case because it takes quite a long time and much effort to infiltrate into a new sensor from a malicious attacker’s point of view. Thus, without loss of generality, it can be assumed that the attack channels remain the same in the long term although it is not revealed to the controller which channels are attacked. However, if the attacked sensor channel changes but does not change frequently, the resilient state estimation scheme to be presented is still applicable. We will simply refer to this assumption as a “ $q$ -sparse sensor attack”.

2.3. Problem Formulation

For the given discrete-time LTI system (1) under Assumptions 1 and 2, this paper investigates how to design an estimator that can recover the state variable x correctly. First, the Gaussian distributed disturbances/noises are handled appropriately, and the optimality in the sense of minimum variance should be recovered. Second, the security against the sparse sensor attack is enhanced, and the attack-resilient estimation with the unbiased state estimate should be achieved. More specifically, this paper considers the problem of proposing a secure and robust state estimation algorithm that generates the estimate that is most likely to have the minimum variance and to be unbiased. In this process, the concept of “redundant observability”, which characterizes the ability of coping with the sparse sensor attack, is utilized to ensure successful state estimation.

The basic condition for the observability of the system (1) with the attack model (2) satisfying Assumption 2, is given in the following assumption. Note that the assumption of “ $2 q$ redundant observability” is an equivalent condition for the system to be observable under $q$ -sparse sensor attacks ([15], Proposition 2,3,6). Here, the state estimation problem becomes challenging because this redundant observability does not guarantee for the entire states to be recovered with only a single sensor.

Assumption 3.

The system (1), or the pair $(A, C)$ , is $2 q$ redundant observable. In other words, each pair $(A, C_{I})$ is observable for any $I \subset [p]$ satisfying $| I | \geq p - 2 q$ .

3. Optimal Information Fusion Kalman Filter Based on Observability Decomposition

3.1. Kalman Observability Decomposition with Single Sensor

Since conventional Luenburger observers or Kalman filters typically have the form of

\hat{x} (k + 1) = (A - K C) \hat{x} (k) + B u (k) + K y^{a} (k),

the whole state estimates $\hat{x}$ are affected by the single sensor attack signal due to the observer gain K. In other words, any single non-zero component of a can alter all components of the state estimate $\hat{x}$ . Hence, we design a collection of observers where each local observer utilizes only a single sensor information so that an attack signal for one sensor channel only interferes with the corresponding local observer and leaves other local observers unaffected.

Consider a single-output system

\begin{matrix} P_{i} : \{\begin{matrix} x (k + 1) = A x (k) + B u (k) + d (k) \\ y_{i}^{a} (k) = c_{i} x (k) + n_{i}^{a} (k) . \end{matrix} \end{matrix}

(3)

where the $i$ -th component of $y^{a} (k)$ in (2), $y_{i}^{a} (k)$ , is the output and the dynamics is given by (1). Since the pair $(A, c_{i})$ is not necessarily observable, an estimator of the system (3) generally recovers only an (observable) portion of the full state x. The Kalman observability decomposition, which clearly describes the observable portion of the system, is now briefly introduced. For the single-output system (3), the observability matrix is written as

\begin{matrix} \begin{matrix} G_{i} : = [\begin{matrix} c_{i} \\ c_{i} A \\ c_{i} A^{2} \\ ⋮ \\ c_{i} A^{n - 1} \end{matrix}], \end{matrix} \end{matrix}

(4)

and we denote $μ_{i}$ as the rank of the observability matrix $G_{i}$ . The null space of $G_{i}$ , $N (G_{i})$ , is the so-called unobservable subspace, and the column range space of $G_{i}^{⊤}$ , $R (G_{i}^{⊤})$ , is often called the observable subspace.

One can define the similarity transformation as

\begin{matrix} [\begin{matrix} z_{i} \\ w_{i} \end{matrix}] = [\begin{matrix} {Z_{i}}^{⊤} \\ W_{i}^{⊤} \end{matrix}] x \end{matrix}

(5)

where $Z_{i} \in R^{n \times μ_{i}}$ is the matrix whose columns are th orthonormal basis of $R (G_{i}^{⊤})$ and $W_{i} \in R^{n \times (n - μ_{i})}$ is the matrix whose columns are the orthonormal basis of $N (G_{i})$ . Here, the size of those matrices is determined by

μ_{i} = rank (G_{i}) = \dim (R (G_{i}^{⊤})) and n - μ_{i} = nullity (G_{i}) = \dim (N (G_{i})) .

Note that the observable subspace $R (G_{i}^{⊤})$ is the span of column vectors in $Z_{i}$ and the unobservable subspace $N (G_{i})$ is the span of column vectors in $W_{i}$ . Since the matrix $[Z_{i} W_{i}]$ is orthogonal, we have

[\begin{matrix} {Z_{i}}^{⊤} \\ W_{i}^{⊤} \end{matrix}] [\begin{matrix} Z_{i} & W_{i} \end{matrix}] = [\begin{matrix} {Z_{i}}^{⊤} Z_{i} & {Z_{i}}^{⊤} W_{i} \\ W_{i}^{⊤} Z_{i} & W_{i}^{⊤} W_{i} \end{matrix}] = [\begin{matrix} I_{μ_{i} \times μ_{i}} & O_{μ_{i} \times (n - μ_{i})} \\ O_{(n - μ_{i}) \times μ_{i}} & I_{(n - μ_{i}) \times (n - μ_{i})} \end{matrix}] .

Moreover, because the unobservable subspace is A-invariant, any columns of $A W_{i}$ belong to $N (G_{i}) = R (W_{i})$ . Therefore, the Kalman observability decomposition of the system (3) is obtained by the transformation (5) as

\begin{matrix} P_{i}^{'} : \{\begin{matrix} [\begin{matrix} z_{i} (k + 1) \\ w_{i} (k + 1) \end{matrix}] = [\begin{matrix} {Z_{i}}^{⊤} A Z_{i} & O_{μ_{i} \times (n - μ_{i})} \\ W_{i}^{⊤} A Z_{i} & W_{i}^{⊤} A W_{i} \end{matrix}] [\begin{matrix} z_{i} (k) \\ w_{i} (k) \end{matrix}] + [\begin{matrix} {Z_{i}}^{⊤} B \\ W_{i}^{⊤} B \end{matrix}] u (k) + [\begin{matrix} {Z_{i}}^{⊤} \\ W_{i}^{⊤} \end{matrix}] d (k) \\ y_{i}^{a} (k) = [\begin{matrix} c_{i} Z_{i} & 0_{1 \times (n - μ_{i})} \end{matrix}] [\begin{matrix} z_{i} (k) \\ w_{i} (k) \end{matrix}] + n_{i}^{a} (k) . \end{matrix} \end{matrix}

(6)

Finally, the state $x \in R^{n}$ is decomposed into the observable sub-state $z_{i} \in R^{μ_{i}}$ and the unobservable sub-state $w_{i} \in R^{n - μ_{i}}$ . Further, the observable part of (6) can simply be written as

\begin{matrix} P_{i}^{o} : \{\begin{matrix} z_{i} (k + 1) = S_{i} z_{i} (k) + Z_{i}^{⊤} B u (k) + Z_{i}^{⊤} d (k) \\ y_{i}^{a} (k) = t_{i} z_{i} (k) + n_{i}^{a} (k) \end{matrix} \end{matrix}

(7)

where $S_{i} : = Z_{i}^{⊤} A Z_{i}$ and $t_{i} : = c_{i} Z_{i}$ .

3.2. Decentralized Multi-Sensor Kalman Filter

Even though the Kalman filter can be applied to unobservable linear systems, the error covariance matrix may not converge in that case. According to ([29], Theorem 26), the detectability of the system is a sufficient condition for the convergence of the error covariance matrix in Kalman filtering. Since detectability is a slightly weaker concept than observability, the results in this paper dealing with observability can be generalized to the concept of detectability with slight modifications. The design of local state estimators for the observable subsystem (7) in the form of Kalman filters using only single sensor information, is derived in this subsection. By its construction, the pair $(Z_{i}^{⊤} A Z_{i}, c_{i} Z_{i})$ , or simply denoted as $(S_{i}, t_{i})$ , is observable, and thus, the error covariance matrix of the Kalman filter designed for the system (7) converges to a positive semidefinite matrix ([29], Theorem 26).

Now, we design a decentralized Kalman filter with each single sensor output, which constitutes the local observer. Then, the design of an information fusion scheme, which collects all the information on state estimates and error covariance matrices from the decentralized Kalman filters, will be discussed in the next subsection. For the simplicity of the derivation, we assume that there are no attacks at this time, that is, $a (k) \equiv 0$ . Thus, $n^{a} (k)$ and $y^{a} (k)$ are interpreted as $n (k)$ and $y (k)$ , respectively, in this section.

Stochastic assumptions on the disturbance $d (k)$ and the noise $n (k)$ of the system (1) are formally stated in Assumption 1 where the covariance matrix R of the measurement noise $n (k)$ is partitioned as

R = [\begin{matrix} R_{1} & R_{12} & \dots & R_{1 p} \\ R_{21} & R_{2} & \dots & R_{2 p} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ R_{p 1} & R_{p 2} & \dots & R_{p} \end{matrix}] .

Finally, the assumption for each measurement noise $n_{i} (k)$ (which is the same as $n_{i}^{a} (k)$ in this section) of the system (3) can be written as follows:

\begin{matrix} n_{i} (k) & \sim N (0, R_{i}), \\ E [n_{i} (k)] & = 0, E [n_{i} (k) n_{i}^{⊤} (t)] & = R_{i} δ_{k t}, \\ E [n_{i} (k) n_{j}^{⊤} (t)] & = R_{i j} δ_{k t}, if i \neq j, \\ E [n_{i} (k) d^{⊤} (t)] & = 0_{1 \times n} . \end{matrix}

The local observer is designed by a Kalman filter for the observable subsystem (7). To this end, let ${\hat{z}}_{i} (k | k - 1)$ be the estimate of $z_{i} (k)$ based on observations from $y^{a} (0)$ to $y^{a} (k - 1)$ . Similarly, ${\hat{z}}_{i} (k | k)$ is the estimate of $z_{i} (k)$ after we process the measurement $y^{a} (k)$ at time k. Following the conventional notations in a Kalman filter, we use the terms $P_{i} (k | k - 1)$ and $P_{i} (k | k)$ to denote the estimation error covariance of ${\hat{z}}_{i} (k | k - 1)$ and ${\hat{z}}_{i} (k | k)$ , respectively. Thus, We have

\begin{matrix} \begin{matrix} P_{i} (k | k - 1) & = E [({\hat{z}}_{i} (k | k - 1) - z_{i} (k)) {({\hat{z}}_{i} (k | k - 1) - z_{i} (k))}^{⊤}], \\ P_{i} (k | k) & = E [({\hat{z}}_{i} (k | k) - z_{i} (k)) {({\hat{z}}_{i} (k | k) - z_{i} (k))}^{⊤}] . \end{matrix} \end{matrix}

(8)

Then, the Kalman filter has the following form of

\begin{matrix} O_{i} : & {\hat{z}}_{i} (k + 1 | k + 1) \\ = & S_{i} {\hat{z}}_{i} (k | k) + Z_{i}^{⊤} B u (k) + K_{i} (k + 1) (y_{i}^{a} (k + 1) - t_{i} (S_{i} {\hat{z}}_{i} (k | k) + Z_{i}^{⊤} B u (k))) \\ = & (I - K_{i} (k + 1) t_{i}) (S_{i} {\hat{z}}_{i} (k | k) + Z_{i}^{⊤} B u (k)) + K_{i} (k + 1) y_{i}^{a} (k + 1), \end{matrix}

(9)

where

\begin{matrix} \begin{matrix} {\hat{z}}_{i} (k + 1 | k + 1) & = {\hat{z}}_{i} (k + 1 | k) + K_{i} (k + 1) (y_{i}^{a} (k + 1) - t_{i} {\hat{z}}_{i} (k + 1 | k)) \end{matrix} \end{matrix}

(10a)

\begin{matrix} \begin{matrix} {\hat{z}}_{i} (k + 1 | k) & = S_{i} {\hat{z}}_{i} (k | k) + Z_{i}^{⊤} B u (k) \end{matrix} \end{matrix}

(10b)

\begin{matrix} \begin{matrix} K_{i} (k + 1) & = P_{i} (k + 1 | k) t_{i}^{⊤} {(t_{i} P_{i} (k + 1 | k) t_{i}^{⊤} + R_{i})}^{- 1} \end{matrix} \end{matrix}

(10c)

\begin{matrix} \begin{matrix} P_{i} (k + 1 | k) & = S_{i} P_{i} (k | k) S_{i}^{⊤} + Z_{i}^{⊤} Q Z_{i} \end{matrix} \end{matrix}

(10d)

\begin{matrix} \begin{matrix} P_{i} (k + 1 | k + 1) & = (I - K_{i} (k + 1) t_{i}) P_{i} (k + 1 | k) \end{matrix} \end{matrix}

(10e)

with initial value of

\begin{matrix} \begin{matrix} {\hat{z}}_{i} (0 | 0) = Z_{i}^{⊤} {\bar{x}}_{0}, P_{i} (0 | 0) = Z_{i}^{⊤} P_{0} Z_{i} . \end{matrix} \end{matrix}

The above Equations (10) describe the recursive form of how the state estimate ${\hat{z}}_{i}$ , the Kalman gain $K_{i}$ , and the error covariance matrix $P_{i}$ evolve. The error covariance $P_{i}$ of the $i$ -th local observer defined in (8), is governed by Equations (10d) and (10e), which ensure that the covariance matrix $P_{i} (k | k)$ can be calculated by the following recursive form:

\begin{matrix} L_{i} : P_{i} (k + 1 | k + 1) & = (I - K_{i} (k + 1) t_{i}) (S_{i} P_{i} (k | k) S_{i}^{⊤} + Z_{i}^{⊤} Q Z_{i}) \end{matrix}

(11)

with the initial value of

P_{i} (0 | 0) = Z_{i}^{⊤} P_{0} Z_{i} .

Similarly, the error cross covariance $P_{i j}$ of the $i$ -th and $j$ -th local observers can be defined by

\begin{matrix} \begin{matrix} P_{i j} (k | k - 1) & = E [({\hat{z}}_{i} (k | k - 1) - z_{i} (k)) {({\hat{z}}_{j} (k | k - 1) - z_{j} (k))}^{⊤}], \\ P_{i j} (k | k) & = E [({\hat{z}}_{i} (k | k) - z_{i} (k)) {({\hat{z}}_{j} (k | k) - z_{j} (k))}^{⊤}], \end{matrix} \end{matrix}

(12)

and the recursive formula for $P_{i j}$ is derived here. To this end, define the estimation error

\begin{matrix} \begin{matrix} {\tilde{z}}_{i} (k + 1 | k) & : = {\hat{z}}_{i} (k + 1 | k) - z_{i} (k + 1) \\ {\tilde{z}}_{i} (k + 1 | k + 1) & : = {\hat{z}}_{i} (k + 1 | k + 1) - z_{i} (k + 1), \end{matrix} \end{matrix}

(13)

and we have that

\begin{matrix} {\tilde{z}}_{i} (k + 1 | k) & = (S_{i} {\hat{z}}_{i} (k | k) + Z_{i}^{⊤} B u (k)) - (S_{i} z_{i} (k) + Z_{i}^{⊤} B u (k) + Z_{i}^{⊤} d (k)) \\ = S_{i} {\tilde{z}}_{i} (k | k) - Z_{i}^{⊤} d (k) \end{matrix}

(14a)

\begin{matrix} {\tilde{z}}_{i} (k + 1 | k + 1) & = ({\hat{z}}_{i} (k + 1 | k) + K_{i} (k + 1) (y_{i}^{a} (k + 1) - t_{i} {\hat{z}}_{i} (k + 1 | k))) - z_{i} (k + 1) \\ = (I - K_{i} (k + 1) t_{i}) {\tilde{z}}_{i} (k + 1 | k) + K_{i} (k + 1) n_{i}^{a} (k + 1) . \end{matrix}

(14b)

By substituting (14a) into (14b), the dynamics of the error ${\tilde{z}}_{i} (k | k)$ is obtained as

\begin{matrix} \begin{matrix} F_{i} : {\tilde{z}}_{i} (k + 1 | k + 1) & = (I - K_{i} (k + 1) t_{i}) S_{i} {\tilde{z}}_{i} (k | k) - (I - K_{i} (k + 1) t_{i}) Z_{i}^{⊤} d (k) \\ + K_{i} (k + 1) n_{i}^{a} (k + 1) . \end{matrix} \end{matrix}

(15)

The errors ${\tilde{z}}_{i} (k | k)$ and ${\tilde{z}}_{j} (k | k)$ for $i \neq j$ may be correlated; thus, by using (15), the error cross covariance between ${\tilde{z}}_{i} (k | k)$ and ${\tilde{z}}_{j} (k | k)$ can be computed recursively. From the recursive form of (15), note that ${\tilde{z}}_{i} (k | k)$ is a linear combination of elements in

\begin{matrix} {{\tilde{z}}_{i} (0 | 0), d (0), \dots, d (k - 1), n_{i}^{a} (0), \dots, n_{i}^{a} (k)} . \end{matrix}

(16)

Therefore, by Assumption 1, we have (i) $n_{i}^{a} (k + 1)$ and $d (k)$ are orthogonal, (ii) ${\tilde{z}}_{i} (k | k)$ and $d (k)$ are orthogonal, and (iii) ${\tilde{z}}_{i} (k | k)$ and $n_{j}^{a} (k + 1)$ are orthogonal. Using these facts, one can derive the recursive form of the error cross covariance between ${\tilde{z}}_{i} (k | k)$ and ${\tilde{z}}_{j} (k | k)$ as follows:

\begin{matrix} L_{i j} : & P_{i j} (k + 1 | k + 1) = E [{\tilde{z}}_{i} (k + 1 | k + 1) {\tilde{z}}_{j}^{⊤} (k + 1 | k + 1)] \\ = & (I - K_{i} (k + 1) t_{i}) (S_{i} E [{\tilde{z}}_{i} (k | k) {\tilde{z}}_{j}^{⊤} (k | k)] S_{j}^{⊤} + Z_{i}^{⊤} Q Z_{j}) {(I - K_{j} (k + 1) t_{j})}^{⊤} \\ + K_{i} (k + 1) E [n_{i}^{a} (k + 1) {n_{j}^{a}}^{⊤} (k + 1)] K_{j}^{⊤} (k + 1) \\ = & (I - K_{i} (k + 1) t_{i}) (S_{i} P_{i j} (k | k) S_{j}^{⊤} + Z_{i}^{⊤} Q Z_{j}) {(I - K_{j} (k + 1) t_{j})}^{⊤} \\ + K_{i} (k + 1) R_{i j} K_{j}^{⊤} (k + 1), \end{matrix}

(17)

with the initial value of

P_{i j} (0 | 0) = Z_{i}^{⊤} P_{0} Z_{j} .

3.3. Optimal Information Fusion Based on Observability Decomposition

Based on the equivalence $Z_{i}^{⊤} x = z_{i}$ in (5) and the definition ${\tilde{z}}_{i} = {\hat{z}}_{i} - z_{i}$ in (13), we have

{\hat{z}}_{i} = z_{i} + {\tilde{z}}_{i} = Z_{i}^{⊤} x + {\tilde{z}}_{i} .

(18)

Stacking Equations (18) for all $i \in [p]$ leads to the following equation of

[\begin{matrix} {\hat{z}}_{1} (k | k) \\ ⋮ \\ {\hat{z}}_{p} (k | k) \end{matrix}] = [\begin{matrix} z_{1} (k) \\ ⋮ \\ z_{p} (k) \end{matrix}] + [\begin{matrix} {\tilde{z}}_{1} (k | k) \\ ⋮ \\ {\tilde{z}}_{p} (k | k) \end{matrix}] = [\begin{matrix} {Z_{1}}^{⊤} \\ ⋮ \\ {Z_{p}}^{⊤} \end{matrix}] x (k) + [\begin{matrix} {\tilde{z}}_{1} (k | k) \\ ⋮ \\ {\tilde{z}}_{p} (k | k) \end{matrix}] .

(19)

Finally, (19) is written in a compact form as

\begin{matrix} \hat{z} (k | k) = Φ x (k) + \tilde{z} (k | k) = Φ x (k) + v^{a} (k) \in R^{μ}, \end{matrix}

(20)

where the matrix

Φ : = [\begin{matrix} {Z_{1}}^{⊤} \\ ⋮ \\ {Z_{p}}^{⊤} \end{matrix}] \in R^{μ \times n}

(21)

is composed of the similarity transformation matrices $Z_{i}$ ’s and $v^{a} (k)$ is used for a simple notation of $\tilde{z} (k | k)$ . In Equation (20),

μ : = \sum_{i = 1}^{p} μ_{i}

denotes the size of the stacked vector.

It should be noted that all the information in (20) except the actual state $x (k)$ , are known or accessible to us. In Section 3.1, the matrix $Φ$ is generated from the orthonormal basis of the observable subspace $R (G_{i}^{⊤})$ where $G_{i}$ is the observability matrix given by (4). In Section 3.2, each local observer $O_{i}$ in (9) provides the state estimate ${\hat{z}}_{i}$ for the observable sub-state $z_{i}$ . Now, the stochastic properties of the last term

v^{a} (k) = \tilde{z} (k | k) = [\begin{matrix} {\tilde{z}}_{1} (k | k) \\ {\tilde{z}}_{2} (k | k) \\ ⋮ \\ {\tilde{z}}_{p} (k | k) \end{matrix}]

are analyzed. First, its mean is zero because ${\tilde{z}}_{i} (k | k)$ is a linear combination of elements in (16) by the Formula (15), and Assumption 1 ensures that every component in (16) has a zero mean. Second, the covariance matrix of $v^{a} (k)$ can be obtained since the error covariance matrix $P_{i}$ is computed by each local observer $L_{i}$ in (11), and the error cross covariance matrix $P_{i j}$ is generated by the second layer of the multi-sensor Kalman filter $L_{i j}$ in (17) with collected information from local observers (see Figure 1 for the structure of the proposed Kalman filter). In summary, we have

v^{a} (k) \sim N (0_{μ \times 1}, P (k | k)),

(22)

where

P (k | k) = [\begin{matrix} P_{1} (k | k) & P_{12} (k | k) & \dots & P_{1 p} (k | k) \\ P_{21} (k | k) & P_{2} (k | k) & \dots & P_{2 p} (k | k) \\ ⋮ & ⋮ & ⋱ & ⋮ \\ P_{p 1} (k | k) & P_{p 2} (k | k) & \dots & P_{p} (k | k) \end{matrix}],

(23)

which can be recursively computed by (11) and (17). Finally, Equation (20) depicts a linear model with the measured data vector $\hat{z}$ , the known matrix $Φ$ , the noise vector $v^{a}$ with a zero-mean Gaussian distribution, and the unknown vector x to be estimated.

Structure of decentralized multi-sensor information fusion Kalman filter.

Based on the statistical estimation and detection theory [30,31], an elaborate derivation process to recover the optimal estimate of x in (20), is now presented. The minimum variance unbiased estimator (MVUE) for the data model (20) with $v^{a}$ satisfying $v^{a} \sim N (0_{μ \times 1}, P)$ is introduced as follows.

Theorem 1

([30], Theorem 4.2). For the measurement $\hat{z} = Φ x + v^{a} \in R^{μ}$ with $x \in R^{n}$ and $v^{a} \in R^{μ}$ such that $v^{a} \sim N (0_{μ \times 1}, P)$ for some $P > 0$ , the minimum variance unbiased estimator (MVUE) of x is

$D : {\hat{x}}_{MVUE} = {(Φ^{⊤} P^{- 1} Φ)}^{- 1} Φ^{⊤} P^{- 1} \hat{z}$ (24)

and the corresponding covariance matrix of ${\hat{x}}_{MVUE}$ is

$P_{{\hat{x}}_{MVUE}} = {(Φ^{⊤} P^{- 1} Φ)}^{- 1},$ (25)

which achieves the minimum covariance in the sense that $P_{{\hat{x}}_{MVUE}} \leq P_{\hat{x}}$ for any type of estimator $\hat{x}$ .

Proof.

The results directly follows from the Gauss–Markov Theorem ([30], Theorem 6.1). However, we provide a direct proof for the readers convenience, and it follows the procedure in the proof of ([24], Theorem 1) or ([25], Theorem 1). We introduce a linear unbiased estimator

$\hat{x} = Ω \hat{z}$

and, from the unbiased assumption, it follows that

$E [\hat{x}] = E [Ω \hat{z}] = Ω E [Φ x + v^{a}] = Ω Φ E [x] = E [x] .$

Thus, we have

$Ω Φ = I_{n \times n} .$ (26)

Let the covariance matrix of the estimation error $\tilde{x} : = \hat{x} - x$ be $P_{x}$ . Then, the estimation error $\tilde{x}$ is obtained that

$\tilde{x} = \hat{x} - x = Ω \hat{z} - x = Ω \hat{z} - Ω Φ x = Ω (\hat{z} - Φ x) = Ω v^{a},$

and the covariance matrix $P_{x}$ can be computed as

$P_{x} = E [\tilde{x} {\tilde{x}}^{⊤}] = E [Ω v^{a} {v^{a}}^{⊤} Ω^{⊤}] = Ω E [v^{a} {v^{a}}^{⊤}] Ω^{⊤} = Ω P Ω^{⊤} .$

In order to find the minimum variance estimator, set the trace of the covariance matrix $P_{x}$ as the performance index

$J : = tr (P_{x}) = tr (Ω P Ω^{⊤}) .$

The Lagrangian [32] associated with J becomes

$L = J + 2 tr (Λ (Ω Φ - I_{n \times n}))$

where $Λ \in R^{n \times n}$ is a matrix representing the Lagrange multipliers. By solving

$\frac{\partial L}{\partial Ω} = O_{n \times μ},$

we have

$Ω P + Λ^{⊤} Φ^{⊤} = O_{n \times μ} .$ (27)

Combining (26) and (27) results in the following equation of

$[\begin{matrix} Ω & Λ^{⊤} \end{matrix}] [\begin{matrix} P & Φ \\ Φ^{⊤} & O_{n \times n} \end{matrix}] = [\begin{matrix} O_{n \times μ} & I_{n \times n} \end{matrix}] .$

Therefore, the matrix inversion lemma ([33], Section 2.3) yields the solution as

$[\begin{matrix} Ω & Λ^{⊤} \end{matrix}] = [\begin{matrix} O_{n \times μ} & I_{n \times n} \end{matrix}] {[\begin{matrix} P & Φ \\ Φ^{⊤} & O_{n \times n} \end{matrix}]}^{- 1} = [\begin{matrix} {(Φ^{⊤} P^{- 1} Φ)}^{- 1} Φ^{⊤} P^{- 1} & - {(Φ^{⊤} P^{- 1} Φ)}^{- 1} \end{matrix}] .$

Thus, we have $Ω = {(Φ^{⊤} P^{- 1} Φ)}^{- 1} Φ^{⊤} P^{- 1}$ . Finally, the MVUE of x in (24), is obtained from ${\hat{x}}_{MVUE} = Ω \hat{z} = {(Φ^{⊤} P^{- 1} Φ)}^{- 1} Φ^{⊤} P^{- 1} \hat{z}$ , and the corresponding covariance matrix in (25) is computed by $P_{{\hat{x}}_{MVUE}} = Ω P Ω^{⊤} = {(Φ^{⊤} P^{- 1} Φ)}^{- 1}$ . □

Theorem 1 explains how the optimal estimate is computed. The information fusion center $D$ calculates the MVUE by (24) and its covariance by (25). In summary, the whole structure of the decentralized multi-sensor information fusion Kalman filter is shown in Figure 1. The first layer is composed of the local observer $O_{i}$ , which generates the estimate ${\hat{z}}_{i}$ and the Kalman gains $K_{i}$ as given in (9) and (10). A part of the local observer $O_{i}$ , denoted as $L_{i}$ , provides the error covariance matrix $P_{i}$ . The second layer $L_{i j}$ collects the Kalman gain $K_{i}$ ’s from the first layer and gives the error cross covariance matrix $P_{i j}$ by (17). Finally, the third layer operates as an optimal information fusion center $D$ as described in Theorem 1 and computes the optimal estimate with the minimum covariance.

Remark 1.

Note that Gauss–Markov Theorem ([30], Theorem 6.1) gives the best linear unbiased estimator (BLUE) for the measurement $\hat{z} = Φ x + v^{a}$ where $v^{a}$ is a random variable, whose probability density function (PDF) is not restricted to a Gaussian distribution, with a zero mean and covariance P. Since the BLUE is also the MVUE for Gaussian data, the results of Theorem 1 also follow directly from the Gauss–Markov Theorem. The state estimate ${\hat{x}}_{MVUE}$ given in Theorem 1 is the optimal estimate since it achieves the minimum variance with an unbiased mean. A special case of Theorem 1 is considered in ([24], Theorem 1) and ([25], Theorem 1) for an information fusion scheme; however, the scheme in [24,25] may not be successful for a system whose local systems with a single sensor are not observable because the covariance matrix P could diverge in that case, whereas the covariance matrix P does not diverge in our scheme due to the Kalman observability decomposition.

4. Attack Resilient and Secure State Estimation by Decentralized Kalman Filter

4.1. Effect of Sparse Sensor Attack on Information Fusion Kalman FIlter

In the previous section, we assumed that all sensors were attack-free, that is, $a (k) \equiv 0$ . Hence, $n_{i}^{a} (k)$ and $y_{i}^{a} (k)$ in (3) and (7) were regarded as non-attacked noise $n_{i} (k)$ and output $y_{i} (k)$ , respectively. The effects of a sparse sensor attack satisfying Assumption 2 on the information fusion Kalman filter developed in Section 3 are investigated in this subsection.

By linearity, the Kalman filter in (10) can be divided into two parts with ${\hat{z}}_{i} = : g_{i} + e_{i}$ as in

\begin{matrix} g_{i} (k + 1 | k + 1) & : = g_{i} (k + 1 | k) + K_{i} (k + 1) (y_{i} (k + 1) - t_{i} g_{i} (k + 1 | k)), \end{matrix}

(28a)

\begin{matrix} e_{i} (k + 1 | k + 1) & : = e_{i} (k + 1 | k) + K_{i} (k + 1) (a_{i} (k + 1) - t_{i} e_{i} (k + 1 | k)), \end{matrix}

(28b)

\begin{matrix} g_{i} (k + 1 | k) & : = S_{i} g_{i} (k | k) + Z_{i}^{⊤} B u (k), \end{matrix}

(28c)

\begin{matrix} e_{i} (k + 1 | k) & : = S_{i} e_{i} (k | k) . \end{matrix}

(28d)

Note that $g_{i} (k + 1 | k + 1)$ and $e_{i} (k + 1 | k + 1)$ have the same dynamics with (10a), while the incoming signal $y_{i}^{a} (k + 1)$ is divided into two parts with $y_{i} (k + 1)$ and $a_{i} (k + 1)$ assigned to the dynamics of $g_{i} (k + 1 | k + 1)$ and $e_{i} (k + 1 | k + 1)$ , respectively. Similarly, $g_{i} (k + 1 | k)$ and $e_{i} (k + 1 | k)$ have the same dynamics with (10b), whereas the incoming signal $u (k)$ is solely assigned to the dynamics of $g_{i} (k + 1 | k)$ . By setting the initial conditions as

g_{i} (0 | 0) = {\hat{z}}_{i} (0 | 0) = Z_{i}^{⊤} {\bar{x}}_{0} and e_{i} (0 | 0) = 0_{μ_{i} \times 1},

it easily follows from (10a) and (10b) that

\begin{matrix} \begin{matrix} {\hat{z}}_{i} (k + 1 | k + 1) & = g_{i} (k + 1 | k + 1) + e_{i} (k + 1 | k + 1), \\ {\hat{z}}_{i} (k + 1 | k) & = g_{i} (k + 1 | k) + e_{i} (k + 1 | k) . \end{matrix} \end{matrix}

(29)

Finally, the local observer $O_{i}$ in (9) is divided into $O_{i}^{y}$ and $O_{i}^{a}$ , as follows:

\begin{matrix} \begin{matrix} O_{i}^{y} : g_{i} (k + 1 | k + 1) & = (I - K_{i} (k + 1) t_{i}) (S_{i} g_{i} (k | k) + Z_{i}^{⊤} B u (k)) + K_{i} (k + 1) y_{i} (k + 1), \end{matrix} \end{matrix}

(30a)

\begin{matrix} \begin{matrix} O_{i}^{a} : e_{i} (k + 1 | k + 1) & = (I - K_{i} (k + 1) t_{i}) S_{i} e_{i} (k | k) + K_{i} (k + 1) a_{i} (k + 1) . \end{matrix} \end{matrix}

(30b)

Now, define the attack-free estimation error

\begin{matrix} \begin{matrix} v_{i} (k + 1 | k + 1) & : = g_{i} (k + 1 | k + 1) - z_{i} (k + 1), \\ v_{i} (k + 1 | k) & : = g_{i} (k + 1 | k) - z_{i} (k + 1), \end{matrix} \end{matrix}

(31)

and we have that

\begin{matrix} v_{i} (k + 1 | k) & = (S_{i} g_{i} (k | k) + Z_{i}^{⊤} B u (k)) - (S_{i} z_{i} (k) + Z_{i}^{⊤} B u (k) + Z_{i}^{⊤} d (k)) \\ = S_{i} v_{i} (k | k) - Z_{i}^{⊤} d (k) \end{matrix}

(32a)

\begin{array}{l} (32b) & \begin{matrix} v_{i} (k + 1 | k + 1) & = (g_{i} (k + 1 | k) + K_{i} (k + 1) (y_{i} (k + 1) - t_{i} g_{i} (k + 1 | k))) - z_{i} (k + 1) \\ = (I - K_{i} (k + 1) t_{i}) v_{i} (k + 1 | k) + K_{i} (k + 1) n_{i} (k + 1) \\ = (I - K_{i} (k + 1) t_{i}) S_{i} v_{i} (k | k) - (I - K_{i} (k + 1) t_{i}) Z_{i}^{⊤} d (k) \end{matrix} \\ (32c) & \begin{matrix} + K_{i} (k + 1) n_{i} (k + 1), \end{matrix} \end{array}

which is the same as (14) and (15) with $n_{i}^{a}$ replaced by $n_{i}$ . By (29) and (31), the total state-estimation error defined in (13) satisfies

{\tilde{z}}_{i} (k + 1 | k + 1) = v_{i} (k + 1 | k + 1) + e_{i} (k + 1 | k + 1),

(33)

and, from (30b) and (32c), its dynamic equation is given as follows:

\begin{matrix} \begin{matrix} F_{i} : {\tilde{z}}_{i} (k + 1 | k + 1) & = (I - K_{i} (k + 1) t_{i}) S_{i} {\tilde{z}}_{i} (k | k) - (I - K_{i} (k + 1) t_{i}) Z_{i}^{⊤} d (k) \\ + K_{i} (k + 1) n_{i} (k + 1) + K_{i} (k + 1) a_{i} (k + 1), \end{matrix} \end{matrix}

(34)

which is a rewrite of (15) using the fact $n_{i}^{a} = n_{i} + a_{i}$ .

For notational simplicity, ${\hat{z}}_{i} (k | k)$ , $v_{i} (k | k)$ , and $e_{i} (k | k)$ are denoted by ${\hat{z}}_{i} (k)$ , $v_{i} (k)$ , and $e_{i} (k)$ , respectively. Then, Equation (19) becomes

[\begin{matrix} {\hat{z}}_{1} (k) \\ ⋮ \\ {\hat{z}}_{p} (k) \end{matrix}] = [\begin{matrix} {Z_{1}}^{⊤} \\ ⋮ \\ {Z_{p}}^{⊤} \end{matrix}] x (k) + [\begin{matrix} v_{1} (k) \\ ⋮ \\ v_{p} (k) \end{matrix}] + [\begin{matrix} e_{1} (k) \\ ⋮ \\ e_{p} (k) \end{matrix}],

(35)

which can be written in a compact form as

\begin{matrix} \hat{z} (k) = Φ x (k) + v (k) + e (k) \in R^{μ} . \end{matrix}

(36)

The above Equation (36) is nothing but (20) with $v^{a}$ replaced by $v + e$ . The properties of v are exactly identical with those of $v^{a}$ in (22) because the derivation in (22) is under the assumption of $a \equiv 0$ meaning $e \equiv 0$ in this case. Thus, we have

v (k) \sim N (0_{μ \times 1}, P (k)),

(37)

where $P (k)$ simply denotes $P (k | k)$ in (23). The attack-induced signal $e (k) = {[e_{1}^{⊤} (k), \dots, e_{p}^{⊤} (k)]}^{⊤}$ evolves according to Equation (30b) (or equivalently (28b) and (28d)) with an initial value of $e_{i} (0) = e_{i} (0 | 0) = 0_{μ_{i} \times 1}$ . Therefore, we have $e_{i} \equiv 0_{μ_{i} \times 1}$ for the healthy sensor with $a_{i} \equiv 0$ , while $e_{i} ≢ 0_{μ_{i} \times 1}$ generally holds for the attacked sensor with $a_{i} ≢ 0$ . Finally, the stacked error vector $e \in R^{μ}$ partitioned by the sequence $\{μ_{i}\}$ , is ( $\{μ_{i}\}$ -stacked) $q$ -sparse by Assumption 2.

4.2. Detection of Sparse Sensor Attack

In the previous subsection, the measurement data have the form $\hat{z} = Φ x + v + e \in R^{μ}$ with unknown signals x, v, and e where the noise-induced signal v can be considered as a random variable whose distribution is $N (0_{μ \times 1}, P)$ and the attack-induced signal e is ( $\{μ_{i}\}$ -stacked) $q$ -sparse. To investigate the properties of the matrix $Φ$ in the measurement data, we borrow the definition of ( $\{μ_{i}\}$ -stacked) $q$ -error detectability and its characterization from [15]. There is a slight modification in the following Definition 1 and Lemma 1 from [15]. They do not append any additional zeros, whereas [15] adds additional zeros to match the size of all partitioned vectors and matrices.

Definition 1

([15], Definition 1). For a finite sequence $\{μ_{i}\} = \{μ_{1}, μ_{2}, \dots, μ_{p}\}$ with $μ = \sum_{i = 1}^{p} μ_{i}$ , a coding matrix $Φ \in R^{μ \times n}$ is said to be ( $\{μ_{i}\}$ -stacked) $q$ -error detectable if, for all $x, x^{'} \in R^{n}$ and ( $\{μ_{i}\}$ -stacked) $q$ -sparse $e \in R^{μ}$ such that $Φ x + e = Φ x^{'}$ , it holds that $x = x^{'}$ .

Accordingly, the matrix $Φ \in R^{μ \times n}$ is not ( $\{μ_{i}\}$ -stacked) $q$ -error detectable if and only if there exist $x, x^{'} \in R^{n}$ satisfying $x \neq x^{'}$ , and ( $\{μ_{i}\}$ -stacked) $q$ -sparse $e \in R^{μ}$ such that $Φ x + e = Φ x^{'}$ . In other words, the matrix $Φ \in R^{μ \times n}$ is ( $\{μ_{i}\}$ -stacked) $q$ -error undetectable if and only if there exist a non-zero $x_{e} \in R^{n}$ and ( $\{μ_{i}\}$ -stacked) $q$ -sparse $e \in R^{μ}$ such that $Φ x_{e} = e$ . Typically, in terms of vectors, the vector $e \in R^{μ}$ is said to be undetectable with respect to $Φ \in R^{μ \times n}$ if $e = Φ x_{e} \in R^{μ}$ for some $x_{e} \in R^{n}$ .

Lemma 1

([15], Proposition 1). For a finite sequence $\{μ_{i}\} = \{μ_{1}, μ_{2}, \dots, μ_{p}\}$ with $μ = \sum_{i = 1}^{p} μ_{i}$ and a matrix $Φ \in R^{μ \times n}$ , the followings are equivalent:

(i)
The matrix $Φ \in R^{μ \times n}$ is ( $\{μ_{i}\}$ -stacked) $q$ -error detectable.

(ii)
For every set $J \subset [p]$ satisfying $| J | \geq p - q$ , $Φ_{I_{J}^{\{μ_{i}\}}}$ has full column rank.

(iii)
For any $x \in R^{n}$ where $x \neq 0_{n \times 1}$ , the vector $Φ x \in R^{μ}$ is not ( $\{μ_{i}\}$ -stacked) $q$ -sparse.

With the estimate $\hat{x}$ of x obtained by MVUE of (24) in Theorem 1, we can calculate the estimated output $Φ \hat{x}$ and generate a residual signal r, which is a difference between the real measurement and the estimated output, that is, $r : = \hat{z} - Φ \hat{x}$ . Then, the residual r becomes another random variable whose distribution is also Gaussian. Finally, the mean and covariance of the Gaussian distributed random variable r is computed in the following theorem.

Theorem 2.

For the measurement $\hat{z} = Φ x + v + e \in R^{μ}$ where $Φ \in R^{μ \times n}$ has full column rank and v satisfies $v \sim N (0_{μ \times 1}, P)$ with $P > 0$ , let $\hat{x} = Ψ \hat{z} = {(Φ^{⊤} P^{- 1} Φ)}^{- 1} Φ^{⊤} P^{- 1} \hat{z}$ and

$\begin{matrix} \begin{matrix} r : = \hat{z} - Φ \hat{x} = (I_{μ \times μ} - Φ Ψ) \hat{z} = (I_{μ \times μ} - Φ {(Φ^{⊤} P^{- 1} Φ)}^{- 1} Φ^{⊤} P^{- 1}) \hat{z}, \end{matrix} \end{matrix}$ (38)

where $Ψ : = {(Φ^{⊤} P^{- 1} Φ)}^{- 1} Φ^{⊤} P^{- 1}$ . Then, the residual r is Gaussian distributed with mean $(I_{μ \times μ} - Φ Ψ) e$ and covariance $(I_{μ \times μ} - Φ Ψ) P$ ,

$r \sim N ((I_{μ \times μ} - Φ {(Φ^{⊤} P^{- 1} Φ)}^{- 1} Φ^{⊤} P^{- 1}) e, P - Φ {(Φ^{⊤} P^{- 1} Φ)}^{- 1} Φ^{⊤}) .$ (39)

Furthermore, $e = Φ x_{e} \in R^{μ}$ for some $x_{e} \in R^{n}$ if and only if the mean of r, $E [r] (= (I_{μ \times μ} - Φ Ψ) e)$ , satisfies $E [r] = 0_{μ \times 1}$ . In other words, e is undetectable with respect to Φ if and only if $E [r] = 0_{μ \times 1}$ .

Proof.

First, the mean of r is computed as follows.

$\begin{matrix} \begin{matrix} E [r] & = E [(I_{μ \times μ} - Φ {(Φ^{⊤} P^{- 1} Φ)}^{- 1} Φ^{⊤} P^{- 1}) \hat{z}] \\ = (I_{μ \times μ} - Φ {(Φ^{⊤} P^{- 1} Φ)}^{- 1} Φ^{⊤} P^{- 1}) E [Φ x + v + e] \\ = (I_{μ \times μ} - Φ {(Φ^{⊤} P^{- 1} Φ)}^{- 1} Φ^{⊤} P^{- 1}) (Φ x + e) \\ = (I_{μ \times μ} - Φ {(Φ^{⊤} P^{- 1} Φ)}^{- 1} Φ^{⊤} P^{- 1}) e = (I_{μ \times μ} - Φ Ψ) e \end{matrix} \end{matrix}$ (40)

Second, because it easily follows that

$\begin{matrix} r - E [r] & = (I_{μ \times μ} - Φ {(Φ^{⊤} P^{- 1} Φ)}^{- 1} Φ^{⊤} P^{- 1}) (\hat{z} - e) \\ = (I_{μ \times μ} - Φ {(Φ^{⊤} P^{- 1} Φ)}^{- 1} Φ^{⊤} P^{- 1}) (Φ x + v) \\ = (I_{μ \times μ} - Φ {(Φ^{⊤} P^{- 1} Φ)}^{- 1} Φ^{⊤} P^{- 1}) v = (I_{μ \times μ} - Φ Ψ) v, \end{matrix}$

the covariance matrix is calculated as

$\begin{matrix} E [(r - E [r]) {(r - E [r])}^{⊤}] = E [(I_{μ \times μ} - Φ Ψ) v v^{⊤} {(I_{μ \times μ} - Φ Ψ)}^{⊤}] \\ = (I_{μ \times μ} - Φ Ψ) E [v v^{⊤}] {(I_{μ \times μ} - Φ Ψ)}^{⊤} = (I_{μ \times μ} - Φ Ψ) P {(I_{μ \times μ} - Φ Ψ)}^{⊤} \\ = (I_{μ \times μ} - Φ {(Φ^{⊤} P^{- 1} Φ)}^{- 1} Φ^{⊤} P^{- 1}) P {(I_{μ \times μ} - Φ {(Φ^{⊤} P^{- 1} Φ)}^{- 1} Φ^{⊤} P^{- 1})}^{⊤} \\ = P - Φ {(Φ^{⊤} P^{- 1} Φ)}^{- 1} Φ^{⊤} = (I_{μ \times μ} - Φ Ψ) P . \end{matrix}$

Moreover, note that

$\begin{matrix} E [r] & = (I_{μ \times μ} - Φ {(Φ^{⊤} P^{- 1} Φ)}^{- 1} Φ^{⊤} P^{- 1}) E [\hat{z}] \end{matrix}$

because of (40), and

$\begin{matrix} E [\hat{z}] & = E [Φ x + v + e] = Φ x + e . \end{matrix}$

Since $Φ {(Φ^{⊤} P^{- 1} Φ)}^{- 1} Φ^{⊤} P^{- 1}$ is a projection matrix and it projects $E [\hat{z}]$ onto the range space of $Φ$ , $R (Φ)$ , we have $E [\hat{z}] = Φ x + e \notin R (Φ)$ if and only if $E [\hat{z}] \neq Φ {(Φ^{⊤} P^{- 1} Φ)}^{- 1} Φ^{⊤} P^{- 1} E [\hat{z}]$ . This implies that $e \notin R (Φ)$ if and only if $(I_{μ \times μ} - Φ {(Φ^{⊤} P^{- 1} Φ)}^{- 1} Φ^{⊤} P^{- 1}) E [\hat{z}] \neq 0_{μ \times 1}$ . This completes the proof. □

Theorem 2 clarifies the mean and covariance of the Gaussian random variable r, and further, characterization of undetectable attacks with statistical analysis is also given. Now, one can derive a detection criterion of ( $\{μ_{i}\}$ -stacked) $q$ -sparse errors based on the property of the residual signal r, assuming that $Φ \in R^{μ \times n}$ is ( $\{μ_{i}\}$ -stacked) $q$ -error detectable and that $e \in R^{μ}$ is actually ( $\{μ_{i}\}$ -stacked) $q$ -sparse. This detection strategy is summarized in the following theorem.

Theorem 3.

For a finite sequence $\{μ_{i}\} = \{μ_{1}, μ_{2}, \dots, μ_{p}\}$ with $μ = \sum_{i = 1}^{p} μ_{i}$ and the measurement $\hat{z} = Φ x + v + e \in R^{μ}$ where $Φ \in R^{μ \times n}$ is ( $\{μ_{i}\}$ -stacked) $q$ -error detectable, $e \in R^{μ}$ is ( $\{μ_{i}\}$ -stacked) $q$ -sparse, and $v \in R^{μ}$ satisfies $v \sim N (0_{μ \times 1}, P)$ with $P > 0$ , let

$r = \hat{z} - Φ \hat{x} = \hat{z} - Φ Ψ \hat{z} = (I_{μ \times μ} - Φ Ψ) \hat{z} = (I_{μ \times μ} - Φ {(Φ^{⊤} P^{- 1} Φ)}^{- 1} Φ^{⊤} P^{- 1}) \hat{z}$

be given. Then, $e = 0_{μ \times 1}$ if and only if $E [r] = 0_{μ \times 1}$ . Moreover, when $e = 0_{μ \times 1}$ , the vector x is exactly recovered by the expectation value of $\hat{x} = Ψ \hat{z} = {(Φ^{⊤} P^{- 1} Φ)}^{- 1} Φ^{⊤} P^{- 1} \hat{z}$ , that is, $x = E [\hat{x}]$ , which means that $\hat{x}$ is an unbiased estimate of x.

Proof.

From Theorem 2, the ( $\{μ_{i}\}$ -stacked) $q$ -sparse e satisfies $e = Φ x_{e} \in R^{μ}$ for some $x_{e} \in R^{n}$ if and only if $E [r] = 0_{μ \times 1}$ . However, any non-zero $e = Φ x_{e} \in R^{μ}$ for some $x_{e} \in R^{n}$ is not ( $\{μ_{i}\}$ -stacked) $q$ -sparse by Lemma 1. (iii) since $Φ \in R^{μ \times n}$ is ( $\{μ_{i}\}$ -stacked) $q$ -error detectable. Therefore, the ( $\{μ_{i}\}$ -stacked) $q$ -sparse $e = Φ x_{e} \in R^{μ}$ should be zero, and the result directly follows. Furthermore, the property of an unbiased estimate (with minimum variance) is easily obtained from Theorem 1. □

From the observation of Theorems 2 and 3, the problem of detecting a non-zero ( $\{μ_{i}\}$ -stacked) $q$ -sparse error signal e with a ( $\{μ_{i}\}$ -stacked) $q$ -error detectable coding matrix $Φ \in R^{μ \times n}$ can be rephrased as: Given the residual signal r, which comes from the Gaussian distribution $N (E [r], P - Φ {(Φ^{⊤} P^{- 1} Φ)}^{- 1} Φ^{⊤})$ , determine if $E [r] = 0_{μ \times 1}$ or $E [r] \neq 0_{μ \times 1}$ . Therefore, the statistical decision theory [31] is helpful in this situation. More precisely, the $χ^{2}$ test for fault detection [22,23], which is widely used to detect unwanted error signals, such as faults or attacks, can be applied.

One can simply apply the $χ^{2}$ test to detect the presence of error signals in the ( $\{μ_{i}\}$ -stacked) measurement $\hat{z}$ given by (36), and its operating scheme is summarized in Algorithm 1. Initially, the attack detection alarm indicator f is set to 0, and then the residual r is computed according to Equation (38). Without any error signal (that is, $e = 0_{μ \times 1}$ ), the residual r follows a Gaussian distribution $N (0, P - Φ {(Φ^{⊤} P^{- 1} Φ)}^{- 1} Φ^{⊤})$ , which is shown in (39). Now, define the standardized residual $ζ : = {(P - Φ {(Φ^{⊤} P^{- 1} Φ)}^{- 1} Φ^{⊤}))}^{- \frac{1}{2}} r$ whose distribution becomes $N (0_{μ \times 1}, I_{μ \times μ})$ . Thus, the 2-norm of $ζ$ denoted by $g : = ζ^{⊤} ζ$ is an observation from a random variable $g$ , which satisfies a $χ^{2}$ distribution with $μ$ degrees of freedom (DOF),

g \sim χ_{μ}^{2} .

This means that g cannot be far away from zero. Finally, when g is greater than a threshold $Δ_{T H}$ , the attack detection alarm is triggered by setting $f = 1$ . Here, $Δ_{T H}$ is the predetermined threshold value, and it decides the probability of false alarm and the probability of error detection. For example, when the threshold $Δ_{T H}$ is chosen such that

\int_{0}^{Δ_{T H}} p_{g} (x) d x = 1 - δ,

(41)

where $p_{g} (x)$ denotes the PDF of the $χ_{μ}^{2}$ distribution, the probability of false alarm becomes $δ$ . As the probability of false alarm $δ$ becomes smaller, the probability of error detection also decreases, which implies that there is a trade-off between the small false alarm and the high error detection ratio. Thus, one needs to choose $Δ_{T H}$ as a good compromise between these two conflicting requirements.

Algorithm 1 Detection scheme based on the

χ^{2}

test

Input:

\hat{z}

Output: f
Initialization:

f = 0

{\hat{x}}_{MVUE} = {(Φ^{⊤} P^{- 1} Φ)}^{- 1} Φ^{⊤} P^{- 1} \hat{z}

r = \hat{z} - Φ {\hat{x}}_{MVUE}

ζ = {(P - Φ {(Φ^{⊤} P^{- 1} Φ)}^{- 1} Φ^{⊤}))}^{- \frac{1}{2}} r

g = ζ^{⊤} ζ

5: if

g \leq Δ_{T H}

then
6:

f = 0

7: else if

g > Δ_{T H}

then
8:

f = 1

9: end if

Open in a new tab

4.3. Secure State Estimation under a Sparse Sensor Attack

In this subsection, an attack-resilient and secure state estimation scheme, which reconstructs the optimal estimate for the state x under Assumptions 1–3, is developed. First, characterization of the matrix $Φ$ defined in (21) under Assumption 3 is given as follows.

Lemma 2

([15], Proposition 1,2,3,6). For a finite sequence $\{μ_{i}\} = \{μ_{1}, μ_{2}, \dots, μ_{p}\}$ with $μ_{i} = rank (G_{i})$ for $i \in [p]$ where $G_{i}$ is the observability matrix given in (4), the followings are equivalent:

(i)
The pair $(A, C)$ is $2 q$ redundant observable.

(ii)
The matrix Φ is ( $\{μ_{i}\}$ -stacked) $2 q$ -error detectable.

(iii)
For every set $J \subset [p]$ satisfying $| J | \geq p - 2 q$ , $Φ_{I_{J}^{\{μ_{i}\}}}$ has full column rank.

(iv)
The pair $(A, C)$ is observable under $q$ -sparse sensor attacks.

Note that the redundancy for observability is $2 q$ , which is twice the sparsity of the attack signal. This is the key point of constructing the state estimation algorithm. We can examine each subset $J_{k} \subset [p]$ of sensors whose size is $p - q$ . In other words, we have $(\binom{p}{q})$ number of subsets $J_{1}, J_{2}, \dots, J_{(\binom{p}{q})}$ where $J_{k} \subset [p]$ and $| J_{k} | = p - q$ for $k = 1, 2, \dots, (\binom{p}{q})$ . Since $Φ$ is ( $\{μ_{i}\}$ -stacked) $2 q$ -error detectable by Assumption 3 and Lemma 2.(ii), it easily follows that $Φ_{I_{J_{k}}^{\{μ_{i}\}}}$ is $q$ -error detectable for $J_{k}$ with $| J_{k} | = p - q$ . This means that, even after removing any $q$ sensors, the remaining outputs still have $q$ redundancy for observability. Therefore, the detection scheme of Theorem 3, which relies on the ( $\{μ_{i}\}$ -stacked) $q$ -error detectability of the coding matrix, can be applied for each subset $J_{k} \subset [p]$ satisfying $| J_{k} | = p - q$ .

The configuration of the secure state estimator, which replaces the information fusion center $D$ in Figure 1, is sketched in Figure 2, and its operation is described in Algorithm 2. Before explaining the operation, let $Ψ$ denote ${(Φ^{⊤} P^{- 1} Φ)}^{- 1} Φ^{⊤} P^{- 1}$ where $Φ$ and P are given in (21) and (23), respectively. Furthermore, the notation for a sub-matrix is slightly abused for simplicity. For example, $P_{J}$ , $Φ_{J}$ , and $Ψ_{J}$ denote

P_{I_{J}^{\{μ_{i}\}}, I_{J}^{\{μ_{i}\}}}, Φ_{I_{J}^{\{μ_{i}\}}}, and {(Φ_{I_{J}^{\{μ_{i}\}}}^{⊤} {(P_{I_{J}^{\{μ_{i}\}}, I_{J}^{\{μ_{i}\}}})}^{- 1} Φ_{I_{J}^{\{μ_{i}\}}})}^{- 1} Φ_{I_{J}^{\{μ_{i}\}}}^{⊤} {(P_{I_{J}^{\{μ_{i}\}}, I_{J}^{\{μ_{i}\}}})}^{- 1},

respectively, where $I_{J}^{\{μ_{i}\}} : = ⋃_{j \in J}^{} \{(\sum_{i = 1}^{j - 1} μ_{i}) + 1, (\sum_{i = 1}^{j - 1} μ_{i}) + 2, \dots, \sum_{i = 1}^{j} μ_{i}\}$ . Recall that $P_{I_{J}^{\{μ_{i}\}}, I_{J}^{\{μ_{i}\}}}$ denotes the matrix obtained from P by eliminating all $i$ -th rows and all $j$ -th columns such that $i \notin I_{J}^{\{μ_{i}\}}$ and $j \notin I_{J}^{\{μ_{i}\}}$ .

Configuration of the resilient estimation scheme with Gaussian disturbance/noise.

Initially, an attack-free index set $J^{*}$ , a state estimate $\hat{x}$ , a standardized residual’s norm g, and a fault alarm signal f, are set to $[p]$ , $Ψ \hat{z}$ , 0, and 0, respectively. The algorithm continually checks if there is any attack in the index set $J^{*}$ based on Algorithm 1. For the given index set $J^{*}$ , the algorithm essentially calculates the MVUE $\hat{x} = Ψ_{J^{*}} {\hat{z}}_{J^{*}}$ , the residual $r = {\hat{z}}_{J^{*}} - Φ_{J^{*}} \hat{x}$ , the standardized residual $ζ = {(P_{J^{*}} - Φ_{J^{*}} Ψ_{J^{*}} P_{J^{*}})}^{- \frac{1}{2}} r$ , and its 2-norm $g = ζ^{⊤} ζ$ only with the measurement and covariance data from the subset $J^{*} \subset [p]$ . Recall from Theorem 2 that, if $e_{j} = e_{I_{j}^{\{μ_{i}\}}} = 0_{μ_{j} \times 1}$ for all $j \in J^{*}$ , we have $r \sim N (0_{μ_{J^{*}} \times 1}, P_{J^{*}} - Φ_{J^{*}} Ψ_{J^{*}} P_{J^{*}})$ where $μ_{J^{*}} : = \sum_{j \in J^{*}} μ_{j} = |I_{J^{*}}^{\{μ_{i}\}}|$ , and thus, $g = ζ^{⊤} ζ$ is an observation from a random variable $g_{J^{*}}$ , which satisfies a $χ^{2}$ distribution with $μ_{J^{*}}$ DOF,

g_{J^{*}} \sim χ_{μ_{J^{*}}}^{2} .

(42)

Therefore, g is used to detect the presence of attack in the subset $J^{*}$ by the $χ^{2}$ test. We compare g with the threshold $Δ_{T H}^{J^{*}}$ , which is designed before running the algorithm and determines the probability of false alarm and the probability of error detection. If $g \leq Δ_{T H}^{J^{*}}$ , the index set $J^{*}$ is declared to be attack-free by setting $f = 0$ and the algorithm simply maintains the selected optimal index set $J^{*}$ . Otherwise, when g is greater than the threshold $Δ_{T H}^{J^{*}}$ , the attack detection alarm is triggered by setting $f = 1$ , and the algorithm starts the process of searching new attack-free index set.

Algorithm 2 Operation of the resilient estimation with Gaussian disturbance/noise

Input:

{\hat{z}}_{1}

{\hat{z}}_{2}

, ⋯,

{\hat{z}}_{p}

P_{1}

P_{12}

, ⋯,

P_{p (p - 1)}

P_{p}

Output:

J^{*}

\hat{x}

, g, f
Initialization:

J^{*} = [p]

\hat{x} = Ψ \hat{z}

g = 0

f = 0

1: while system (1) is running do
2:

\hat{x} = Ψ_{J^{*}} {\hat{z}}_{J^{*}}

r = {\hat{z}}_{J^{*}} - Φ_{J^{*}} \hat{x}

ζ = {(P_{J^{*}} - Φ_{J^{*}} Ψ_{J^{*}} P_{J^{*}})}^{- \frac{1}{2}} r

g = ζ^{⊤} ζ

6: if

g \leq Δ_{T H}^{J^{*}}

then
7:

f = 0

8: else if

g > Δ_{T H}^{J^{*}}

then
9:

f = 1

10: for

J \subset [p]

satisfying

| J | = p - q

do
11:

{\hat{x}}^{J} = Ψ_{J} {\hat{z}}_{J}

12:

r^{J} = {\hat{z}}_{J} - Φ_{J} {\hat{x}}^{J}

13:

ζ^{J} = {(P_{J} - Φ_{J} Ψ_{J} P_{J})}^{- \frac{1}{2}} r^{J}

14:

g^{J} = {ζ^{J}}^{⊤} ζ^{J}

15: end for
16:

J^{*} = \underset{\begin{matrix} J \subset [p] \\ | J | = p - q \end{matrix}}{arg max} p_{g_{J}} (g^{J})

17: end if
18: end while

Open in a new tab

In order to find a new attack-free index set and, consequently, to recover the state x from the new index set, we search all subsets $J_{k}$ ’s in $[p]$ whose size is $p - q$ . For a detailed explanation, let

\{J_{1}, J_{2}, \dots, J_{(\binom{p}{q})}\}

be the set $\{J \subset [p] : | J | = p - q\}$ . For each subset $J_{k}$ where $k \in [(\binom{p}{q})]$ , the computing module $C_{k}$ calculates the MVUE ${\hat{x}}^{J_{k}} = Ψ_{J_{k}} {\hat{z}}_{J_{k}}$ , the residual $r^{J_{k}} = {\hat{z}}_{J_{k}} - Φ_{J_{k}} {\hat{x}}^{J_{k}}$ , the standardized residual $ζ^{J_{k}} = {(P_{J_{k}} - Φ_{J_{k}} Ψ_{J_{k}} P_{J_{k}})}^{- \frac{1}{2}} r^{J_{k}}$ , and its 2-norm $g^{J_{k}} = {ζ^{J_{k}}}^{⊤} ζ^{J_{k}}$ only with the measurement and covariance data from the subset $J_{k}$ . Now, the new optimal subset $J^{*}$ is decided by the maximum likelihood (ML) decision rule with the values of $g^{J_{k}}$ ’s, and the selector $S$ chooses the optimal index set $J^{*}$ by the ML decision rule. To this end, we wish to distinguish between $(\binom{p}{q})$ hypotheses, $H_{1}, H_{2}, \dots, H_{(\binom{p}{q})}$ , which are given as follows:

H_{k} : the set J_{k} is attack - free, i . e ., e_{j} = e_{I_{j}^{\{μ_{i}\}}} = 0_{μ_{j} \times 1} for all j \in J_{k} .

Let us denote $g_{k}$ as a random variable such that $g^{J_{k}}$ is a single observation from $g_{k}$ , whereas $g_{J_{k}}$ denotes a random variable such that

g_{J_{k}} \sim χ_{μ_{J_{k}}}^{2}

with $μ_{J_{k}} : = \sum_{j \in J_{k}} μ_{j} = |I_{J_{k}}^{\{μ_{i}\}}|$ and $p_{g_{J_{k}}}$ is the PDF of the $χ_{μ_{J_{k}}}^{2}$ distribution. Note that, if the sensors indexed by $J_{k}$ are attack-free, then the random variable $g_{k}$ as well as $g_{J_{k}}$ follows the $χ^{2}$ distribution with $μ_{J_{k}}$ DOF. The ML decision rule choose the hypothesis $H_{k^{*}}$ and the corresponding optimal index set $J_{k^{*}}$ that maximize the likelihood $p_{g_{k}} (g^{J_{k}}; H_{k})$ , which is the PDF of $g_{k}$ being equal to the observation $g^{J_{k}}$ under the hypothesis $H_{k}$ (that is, under the condition that there is no attack signal in the measurements indexed by $J_{k}$ ). Therefore, we have

\begin{matrix} J^{*} = J_{k^{*}} & = \underset{k \in [(\binom{p}{q})]}{arg max} p_{g_{k}} (g^{J_{k}}; H_{k}) = \underset{\begin{matrix} J \subset [p] \\ | J | = p - q \end{matrix}}{arg max} p_{g_{J}} (g^{J}), \end{matrix}

where the last equality comes from the fact that $g_{k} \sim χ_{μ_{J_{k}}}^{2}$ under the hypothesis $H_{k}$ so that it follows the PDF of the $χ^{2}$ distribution. Therefore, from the index set $J_{k^{*}}$ corresponding to the ML hypothesis $H_{k^{*}}$ , the MVUE of the newly selected optimal index set $J^{*} (= J_{k^{*}})$ , ${\hat{x}}^{J^{*}}$ , becomes the final suboptimal estimate of x.

Remark 2.

The proposed algorithm selects the subset of sensors $J^{*} \subset [p]$ , which is most likely to be attack-free with $| J^{*} | = p - q$ . Moreover, if the selected set $J^{*}$ is actually attack-free, it gives the minimum variance with unbiased estimation. In short, Algorithm 2 generates a state estimate, which is most likely to have minimum variance with unbiased mean. However, we say that it is a suboptimal estimate of x instead of the optimal estimate because the decentralized multi-sensor information fusion Kalman filter cannot ensure to achieve the centralized optimal covariance even without attack as illustrated in ([24], Section 5).

Remark 3.

Note that Algorithm 2 needs to prepare $(\binom{p}{q})$ candidates and compare all those candidates. The time complexity of the error correction algorithm depends on the number of combinations $(\binom{p}{q})$ , and thus, it has the polynomial time complexity of $O (p^{min {q, p - q}})$ . Therefore, the proposed algorithm may not be scalable for very large p with $q \approx p / 2$ due to the combinatorial nature of the algorithm. The time complexity could be reduced by imposing additional restrictive assumptions as done in [20,21] which reformulate the problem into a convex optimization problem. However, in our scheme demanding minimal assumptions, the comibinatorial algorithm only needs to operate when an attack is detected. In addition, most of the time, only the attack detection algorithm requiring a small amount of computation, is executed. Another advantage of the proposed algorithm is that its space complexity is linear with the number of sensors p, that is, $O (p)$ . The total memory size required for local observers is $\sum_{i = 1}^{p} μ_{i} \leq n p$ , whereas if all possible combinations of estimator candidates are configured as real observers, the observer’s size becomes $n (\binom{p}{q})$ .

5. Simulation Results

We consider a motor-controlled multi-DOF torsion system [34] as depicted in Figure 3. A continuous-time state-space model of the system when the control input is the torque $τ$ ( $N \cdot m$ ) generated by the servo motor is given by

\begin{matrix} P_{c}^{'} : \{\begin{matrix} \dot{x} (t) = A_{c}^{'} x (t) + B_{c}^{'} τ (t) + d (t) \\ y (t) = C_{c} x (t) + n (t) \end{matrix} \end{matrix}

(43)

with the matrices

\begin{matrix} \begin{matrix} A_{c}^{'} & = [\begin{matrix} 0 & 1 & 0 & 0 & 0 & 0 \\ - \frac{k_{1}}{J_{1}} & - \frac{b_{1}}{J_{1}} & \frac{k_{1}}{J_{1}} & 0 & 0 & 0 \\ 0 & 0 & 0 & 1 & 0 & 0 \\ \frac{k_{1}}{J_{2}} & 0 & - \frac{k_{1} + k_{2}}{J_{2}} & - \frac{b_{2}}{J_{2}} & \frac{k_{2}}{J_{2}} & 0 \\ 0 & 0 & 0 & 0 & 0 & 1 \\ 0 & 0 & \frac{k_{2}}{J_{3}} & 0 & - \frac{k_{2}}{J_{3}} & - \frac{b_{3}}{J_{3}} \end{matrix}], B_{c}^{'} = [\begin{matrix} 0 \\ \frac{1}{J_{1}} \\ 0 \\ 0 \\ 0 \\ 0 \end{matrix}], \\ C_{c} & = [\begin{matrix} 1 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 1 & 0 \\ 1 & 0 & - 1 & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 & - 1 & 0 \end{matrix}], \end{matrix} \end{matrix}

(44)

where

\begin{matrix} x : = [\begin{matrix} θ_{1} \\ {\dot{θ}}_{1} \\ θ_{2} \\ {\dot{θ}}_{2} \\ θ_{3} \\ {\dot{θ}}_{3} \end{matrix}] and y : = [\begin{matrix} θ_{1} \\ θ_{2} \\ θ_{3} \\ θ_{1} - θ_{2} \\ θ_{2} - θ_{3} \end{matrix}] \end{matrix}

are the state variable and the output measurement, respectively. Here, the unit for angular positions $θ$ ’s and the unit for angular velocities $\dot{θ}$ ’s are ( $rad$ ) and ( $rad / s$ ), respectively. The parameters are borrowed from [34], and we have that $J_{1} = 0.0022$ , $J_{2} = J_{3} = 0.000545$ ( $kg \cdot m^{2}$ ) for the moment of inertia, $b_{1} = 0.015$ , $b_{2} = b_{3} = 0.0015$ ( $N \cdot m / (rad / s)$ ) for the viscous damping ratio, and $k_{1} = k_{2} = 1$ ( $N \cdot m / rad)$ for the flexible stiffness.

Motor control system of multi-DOF torsion modules.

Note that the dynamics are the same as those of the three inertia system considered in [15]; however, Figure 3 additionally considers the servo motor system given as follows:

\begin{matrix} \begin{matrix} τ = \frac{η_{g} K_{g} η_{m} k_{t} (u - K_{g} k_{m} {\dot{θ}}_{1})}{R_{m}}, \end{matrix} \end{matrix}

(45)

which generates the torque $τ$ ( $N \cdot m$ ) from the input voltage of u ( $V$ ). The parameters for the servo system are also borrowed from [34], and we have that $η_{g} = 0.9$ for the gearbox efficiency, $K_{g} = 70$ for the total gear ratio, $η_{m} = 0.69$ for the motor efficiency, $k_{t} = 0.00768$ ( $N \cdot m / A$ ) for the motor current torque constant, $k_{m} = 0.00768$ ( $V / (rad / s)$ ) for the motor back electromotive force (EMF) constant, and $R_{m} = 2.6$ ( $Ω$ ) for the motor armature resistance. Thus, the final continuous-time plant with the voltage u ( $V$ ) as an input signal is obtained as

\begin{matrix} P_{c} : \{\begin{matrix} \dot{x} (t) = A_{c} x (t) + B_{c} u (t) + d (t) \\ y (t) = C_{c} x (t) + n (t) \end{matrix} \end{matrix}

(46)

with the matrices

\begin{matrix} \begin{matrix} A_{c} & = [\begin{matrix} 0 & 1 & 0 & 0 & 0 & 0 \\ - \frac{k_{1}}{J_{1}} & - \frac{b_{1}}{J_{1}} - \frac{η_{g} K_{g}^{2} η_{m} k_{t} k_{m}}{R_{m}} \frac{1}{J_{1}} & \frac{k_{1}}{J_{1}} & 0 & 0 & 0 \\ 0 & 0 & 0 & 1 & 0 & 0 \\ \frac{k_{1}}{J_{2}} & 0 & - \frac{k_{1} + k_{2}}{J_{2}} & - \frac{b_{2}}{J_{2}} & \frac{k_{2}}{J_{2}} & 0 \\ 0 & 0 & 0 & 0 & 0 & 1 \\ 0 & 0 & \frac{k_{2}}{J_{3}} & 0 & - \frac{k_{2}}{J_{3}} & - \frac{b_{3}}{J_{3}} \end{matrix}], B_{c} = [\begin{matrix} 0 \\ \frac{η_{g} K_{g} η_{m} k_{t}}{R_{m}} \frac{1}{J_{1}} \\ 0 \\ 0 \\ 0 \\ 0 \end{matrix}], \end{matrix} \end{matrix}

(47)

and the same $C_{c}$ as in (44). Finally, the zero-order hold equivalent model of (46) is used for the discrete-time model $P$ in (1), and the matrices are calculated by

\begin{matrix} A : = e^{A_{c} T_{s}}, B : = (\int_{0}^{T_{s}} e^{A_{c} τ} d τ) B_{c}, C : = C_{c} \end{matrix}

(48)

with the sampling time of $T_{s} = 0.002$ ( $s$ ). By examining all possible combinations of sensors, it follows that the system $P$ in (1) with A and C given in (48) is 2-redundant observable, and hence it is observable under 1-sparse sensor attack by Lemma 2.

In addition, the disturbance d and the noise n are assumed to satisfy Assumption 1 with

\begin{matrix} \begin{matrix} Q & = 0 . 001^{2} \times [\begin{matrix} 1 & 0 & 0 & 0 & 0 & 0 \\ 0 & 9 & 0 & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 0 & 0 & 1 \end{matrix}], R = 0 . 001^{2} \times [\begin{matrix} 1 & 0 & 0 & 1 & 0 \\ 0 & 1 & 0 & - 1 & 1 \\ 0 & 0 & 1 & 0 & - 1 \\ 1 & - 1 & 0 & 3 & - 1 \\ 0 & 1 & - 1 & - 1 & 3 \end{matrix}], \end{matrix} \end{matrix}

and the initial state $x (0)$ of the system (46) satisfies $x (0) \sim N ({\bar{x}}_{0}, P_{0})$ as stated in Assumption 1 with the mean ${\bar{x}}_{0}$ and the covariance $P_{0}$ given by

\begin{matrix} \begin{matrix} {\bar{x}}_{0} & = [\begin{matrix} 0 \\ 0 \\ 0 \\ 0 \\ 0 \\ 0 \end{matrix}], P_{0} = [\begin{matrix} 1 & 0 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 0 & 0 & 1 \end{matrix}] . \end{matrix} \end{matrix}

The simulation is performed under 1-sparse sensor attack on the third sensor with the signal shown in Figure 4b, which is made to mimic the motion pattern by the natural frequency as observed in Figure 4c,d. Moreover, the attack starts at 2 second, which is the same time when the square pulse input u is injected into the system as described in Figure 4a. Even under the attack signal, the resilient state estimation with multi-sensor information fusion Kalman filter based on the observability decomposition developed in Section 3 and Section 4 works well. The states are recovered with a small error as demonstrated in Figure 4c,d, which are the state estimation results for $θ_{3}$ and ${\dot{θ}}_{3}$ , respectively.

Plot of signals in a multi-DOF torsion system.

In this simulation, the threshold $Δ_{T H}$ for the attack detection is chosen by $δ = 0.05$ in (41) so that the cumulative density function (CDF) satisfies $\int_{0}^{Δ_{T H}} p_{g_{J^{*}}} (x) d x = 0.95$ where $p_{g_{J^{*}}}$ is the PDF of a random variable $g_{J^{*}}$ , which satisfies a $χ^{2}$ distribution with $μ_{J^{*}}$ DOF, as stated in (42). Since Figure 4e shows that the 2-norm of the standardized residual, g, exceeds the threshold $Δ_{T H}$ at the instant of 2 second, which is the initiation time of the attack, it is judged that there is an attack (the lines from 8 to 9 in Algorithm 2) and the estimation scheme begins to search the indices of attack-free sensors (the lines from 10 to 16 in Algorithm 2). As a result of the search algorithm, a new set of sensor indices is found by the ML decision rule (the line 16 in Algorithm 2), and the attacked third senor is excluded from 2 second as depicted in Figure 4f.

6. Conclusions

In this paper, the multi-sensor information fusion Kalman filter proposed in [24,25] was improved using the observability decomposition to ensure the convergence of the error covariance matrix of each local observer. The local observer of a decentralized Kalman filter with only a single sensor was designed for an observable subspace instead of the entire n-dimensional state vector without any global information. Then, the proposed decentralized information fusion Kalman filter was applied to the secure state estimation problem where some of sensors were compromised by a malicious attacker.

To cope with the zero-mean Gaussian distributed disturbances/noises, a local Kalman filter replaced the partial Luenberger observer designed in [15], where bounded disturbances/noises were considered in the state estimation problem under sparse sensor attacks. When there was no attack, the proposed algorithm guaranteed an optimal state estimate in the sense of minimum variance, and it generated a state estimate that was most likely to have the minimum variance with an unbiased mean in the presence of sparse sensor attacks.

The proposed algorithm can be applied to cyber-physical systems, including complex sensor networks operating based on linear dynamics under sparse sensor attacks as well as Gaussian disturbances/noises. We imposed the minimal assumption of the redundant observability, which is known to be the equivalent condition to solve the problem. Furthermore, the computational time was alleviated by running only a relatively light attack detection scheme for most of the execution time, and the memory size of the observer was reduced by constructing local observers only for observable subspaces.

One possible direction of future research is to develop a distributed attack-resilient state estimator. While this paper proposed a decentralized Kalman filter scheme, the fusion center collects all the data from each sensors. Although the construction of local Kalman filters is decentralized, the information fusion method is still centralized. Therefore, it is necessary to develop a fully distributed attack-resilient state estimation technique for a general sensor network without any central information fusion center.

Abbreviations

The following abbreviations are used in this manuscript:

LTI	Linear Time Invariant
i.i.d.	independent and identically distributed
MVUE	Minimum Variance Unbiased Estimator
BLUE	Best Linear Unbiased Estimator
PDF	Probability Density Function
DOF	Degrees Of Freedom
ML	Maximum Likelihood
EMF	ElectroMotive Force
CDF	Cumulative Density Function

Open in a new tab

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Funding Statement

This work was supported by the Materials & Components Technology Development Program (20017351, Development of Servo System Technology with a Current Response of 6.2 kHz and Power Regeneration for Automated Manufacturing Equipment Application) funded by the Ministry of Trade, Industry & Energy (MOTIE, Korea).

Footnotes

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

1.Pasqualetti F., Dörfler F., Bullo F. Attack detection and identification in cyber-physical systems. IEEE Trans. Autom. Control. 2013;58:2715–2729. doi: 10.1109/TAC.2013.2266831. [DOI] [Google Scholar]
2.Sandberg H., Amin S., Johansson K.H. Cyberphysical security in networked control systems: An introduction to the issue. IEEE Control Syst. Mag. 2015;35:20–23. [Google Scholar]
3.Teixeira A., Shames I., Sandberg H., Johansson K.H. A secure control framework for resource-limited adversaries. Automatica. 2015;51:135–148. doi: 10.1016/j.automatica.2014.10.067. [DOI] [Google Scholar]
4.Zhang X., Zhu F., Zhang J., Liu T. Attack isolation and location for a complex network cyber-physical system via zonotope theory. Neurocomputing. 2022;469:239–250. doi: 10.1016/j.neucom.2021.10.070. [DOI] [Google Scholar]
5.Langner R. Stuxnet: Dissecting a cyberwarfare weapon. IEEE Secur. Priv. 2011;9:49–51. doi: 10.1109/MSP.2011.67. [DOI] [Google Scholar]
6.Wright A. Hacking cars. Commun. ACM. 2011;54:18–19. doi: 10.1145/2018396.2018403. [DOI] [Google Scholar]
7.Ten C.-W., Liu C.-C., Manimaran G. Vulnerability assessment of cybersecurity for SCADA systems. IEEE Trans. Power Syst. 2008;23:1836–1846. doi: 10.1109/TPWRS.2008.2002298. [DOI] [Google Scholar]
8.Dutta A., Langbort C. Confiscating flight control system by stealthy output injection attack. J. Aerosp. Inf. Syst. 2017;14:203–213. doi: 10.2514/1.I010494. [DOI] [Google Scholar]
9.Liu Y., Ning P., Reiter M.K. False data injection attacks against state estimation in electric power grids. ACM Trans. Inf. Syst. Secur. 2011;14:13:1–13:33. doi: 10.1145/1952982.1952995. [DOI] [Google Scholar]
10.Fawzi H., Tabuada P., Diggavi S. Secure estimation and control for cyber-physical systems under adversarial attacks. IEEE Trans. Autom. Control. 2014;59:1454–1467. doi: 10.1109/TAC.2014.2303233. [DOI] [Google Scholar]
11.Chen Y., Kar S., Moura J.M.F. Cyber-physical systems: Dynamic sensor attacks and strong observability; Proceedings of the 40th IEEE International Conference on Acoustics, Speech and Signal Processing; Brisbane, Australia. 19–24 April 2015; pp. 1752–1756. [Google Scholar]
12.Shoukry Y., Tabuada P. Event-triggered state observers for sparse sensor noise/attacks. IEEE Trans. Autom. Control. 2016;61:2079–2091. doi: 10.1109/TAC.2015.2492159. [DOI] [Google Scholar]
13.Shoukry Y., Nuzzo P., Puggelli A., Sangiovanni-Vincentelli A.L., Seshiz S.A., Tabuada P. Secure state estimation for cyber physical systems under sensor attacks: A satisfiability modulo theory approach. IEEE Trans. Autom. Control. 2017;62:4917–4932. doi: 10.1109/TAC.2017.2676679. [DOI] [Google Scholar]
14.An L., Yang G.-H. State estimation under sparse sensor attacks: A constrained set partitioning approach. IEEE Trans. Autom. Control. 2019;64:3861–3868. doi: 10.1109/TAC.2018.2885063. [DOI] [Google Scholar]
15.Lee C., Shim H., Eun Y. On redundant observability: From security index to attack detection and resilient state estimation. IEEE Trans. Autom. Control. 2019;64:775–782. doi: 10.1109/TAC.2018.2837107. [DOI] [Google Scholar]
16.Candès E.J., Tao T. Decoding by linear programming. IEEE Trans. Inf. Theory. 2005;51:4203–4215. doi: 10.1109/TIT.2005.858979. [DOI] [Google Scholar]
17.Donoho D.L. Compressed sensing. IEEE Trans. Inf. Theory. 2006;52:1289–1306. doi: 10.1109/TIT.2006.871582. [DOI] [Google Scholar]
18.Pajic M., Lee I., Pappas G.J. Attack-resilient state estimation for noisy dynamical systems. IEEE Trans. Control Netw. Syst. 2017;4:82–92. doi: 10.1109/TCNS.2016.2607420. [DOI] [Google Scholar]
19.Mishra S., Shoukry Y., Karamchandani N., Diggavi S., Tabuada P. Secure state estimation against sensor attacks in the presence of noise. IEEE Trans. Control Netw. Syst. 2017;4:49–59. doi: 10.1109/TCNS.2016.2606880. [DOI] [Google Scholar]
20.Chang Y.H., Hu Q., Tomlin C.J. Secure estimation based Kalman filter for cyber-physical systems against sensor attacks. Automatica. 2018;95:399–412. doi: 10.1016/j.automatica.2018.06.010. [DOI] [Google Scholar]
21.Liu X., Mo Y., Garone E. Local decomposition of Kalman filters and its application for secure state estimation. IEEE Trans. Autom. Control. 2021;66:5037–5044. doi: 10.1109/TAC.2020.3044854. [DOI] [Google Scholar]
22.Mehra R.K., Peschon J. An innovations approach to fault detection and diagnosis in dynamic systems. Automatica. 1971;7:637–640. doi: 10.1016/0005-1098(71)90028-8. [DOI] [Google Scholar]
23.Brumback B., Srinath M. A chi-square test for fault-detection in Kalman filters. IEEE Trans. Autom. Control. 1987;32:552–554. doi: 10.1109/TAC.1987.1104658. [DOI] [Google Scholar]
24.Sun S.-L., Deng Z.-L. Multi-sensor optimal information fusion Kalman filter. Automatica. 2004;40:1017–1023. doi: 10.1016/j.automatica.2004.01.014. [DOI] [Google Scholar]
25.Sun S.-L. Multi-sensor optimal information fusion Kalman filters with applications. Aerosp. Sci. Technol. 2004;8:57–62. doi: 10.1016/j.ast.2003.08.003. [DOI] [Google Scholar]
26.Kim J., Shim H., Wu J. On distributed optimal Kalman-Bucy filtering by averaging dynamics of heterogeneous agents; Proceedings of the 55th IEEE Conference on Decision and Control; Las Vegas, NV, USA. 12–14 December 2016; pp. 6309–6314. [Google Scholar]
27.Kim T., Lee C., Shim H. Completely decentralized design of distributed observer for linear systems. IEEE Trans. Autom. Control. 2020;65:4664–4678. doi: 10.1109/TAC.2019.2962360. [DOI] [Google Scholar]
28.Lee C. Ph.D. Dissertation. Seoul National University; Seoul, Korea: 2018. Attack-Resilient Feedback Control Systems: Secure State Estimation under Sensor Attacks. [Google Scholar]
29.Simon D. Optimal State Estimation: Kalman, H Infinity, and Nonlinear Approaches. Wiley-Interscience; Hoboken, NJ, USA: 2006. [Google Scholar]
30.Kay S.M. Fundamentals of Statistical Signal Processing, Volume I: Estimation Theory. Prentice Hall PTR; Upper Saddle River, NJ, USA: 1993. [Google Scholar]
31.Kay S.M. Fundamentals of Statistical Signal Processing, Volume II: Detection Theory. Prentice Hall PTR; Upper Saddle River, NJ, USA: 1993. [Google Scholar]
32.Boyd S., Vandenberghe L. Convex Optimization. Cambridge University Press; Cambridge, UK: 2004. [Google Scholar]
33.Zhou K., Doyle J.C. Essentials of Robust Control. Prentice Hall; Upper Saddle River, NJ, USA: 1998. [Google Scholar]
34.Quanser Inc. Multi-DOF Torsion Experiment User Manual. Quanser Inc.; Markham, ON, Canada: 2012. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Not applicable.

[B1-sensors-22-06909] 1.Pasqualetti F., Dörfler F., Bullo F. Attack detection and identification in cyber-physical systems. IEEE Trans. Autom. Control. 2013;58:2715–2729. doi: 10.1109/TAC.2013.2266831. [DOI] [Google Scholar]

[B2-sensors-22-06909] 2.Sandberg H., Amin S., Johansson K.H. Cyberphysical security in networked control systems: An introduction to the issue. IEEE Control Syst. Mag. 2015;35:20–23. [Google Scholar]

[B3-sensors-22-06909] 3.Teixeira A., Shames I., Sandberg H., Johansson K.H. A secure control framework for resource-limited adversaries. Automatica. 2015;51:135–148. doi: 10.1016/j.automatica.2014.10.067. [DOI] [Google Scholar]

[B4-sensors-22-06909] 4.Zhang X., Zhu F., Zhang J., Liu T. Attack isolation and location for a complex network cyber-physical system via zonotope theory. Neurocomputing. 2022;469:239–250. doi: 10.1016/j.neucom.2021.10.070. [DOI] [Google Scholar]

[B5-sensors-22-06909] 5.Langner R. Stuxnet: Dissecting a cyberwarfare weapon. IEEE Secur. Priv. 2011;9:49–51. doi: 10.1109/MSP.2011.67. [DOI] [Google Scholar]

[B6-sensors-22-06909] 6.Wright A. Hacking cars. Commun. ACM. 2011;54:18–19. doi: 10.1145/2018396.2018403. [DOI] [Google Scholar]

[B7-sensors-22-06909] 7.Ten C.-W., Liu C.-C., Manimaran G. Vulnerability assessment of cybersecurity for SCADA systems. IEEE Trans. Power Syst. 2008;23:1836–1846. doi: 10.1109/TPWRS.2008.2002298. [DOI] [Google Scholar]

[B8-sensors-22-06909] 8.Dutta A., Langbort C. Confiscating flight control system by stealthy output injection attack. J. Aerosp. Inf. Syst. 2017;14:203–213. doi: 10.2514/1.I010494. [DOI] [Google Scholar]

[B9-sensors-22-06909] 9.Liu Y., Ning P., Reiter M.K. False data injection attacks against state estimation in electric power grids. ACM Trans. Inf. Syst. Secur. 2011;14:13:1–13:33. doi: 10.1145/1952982.1952995. [DOI] [Google Scholar]

[B10-sensors-22-06909] 10.Fawzi H., Tabuada P., Diggavi S. Secure estimation and control for cyber-physical systems under adversarial attacks. IEEE Trans. Autom. Control. 2014;59:1454–1467. doi: 10.1109/TAC.2014.2303233. [DOI] [Google Scholar]

[B11-sensors-22-06909] 11.Chen Y., Kar S., Moura J.M.F. Cyber-physical systems: Dynamic sensor attacks and strong observability; Proceedings of the 40th IEEE International Conference on Acoustics, Speech and Signal Processing; Brisbane, Australia. 19–24 April 2015; pp. 1752–1756. [Google Scholar]

[B12-sensors-22-06909] 12.Shoukry Y., Tabuada P. Event-triggered state observers for sparse sensor noise/attacks. IEEE Trans. Autom. Control. 2016;61:2079–2091. doi: 10.1109/TAC.2015.2492159. [DOI] [Google Scholar]

[B13-sensors-22-06909] 13.Shoukry Y., Nuzzo P., Puggelli A., Sangiovanni-Vincentelli A.L., Seshiz S.A., Tabuada P. Secure state estimation for cyber physical systems under sensor attacks: A satisfiability modulo theory approach. IEEE Trans. Autom. Control. 2017;62:4917–4932. doi: 10.1109/TAC.2017.2676679. [DOI] [Google Scholar]

[B14-sensors-22-06909] 14.An L., Yang G.-H. State estimation under sparse sensor attacks: A constrained set partitioning approach. IEEE Trans. Autom. Control. 2019;64:3861–3868. doi: 10.1109/TAC.2018.2885063. [DOI] [Google Scholar]

[B15-sensors-22-06909] 15.Lee C., Shim H., Eun Y. On redundant observability: From security index to attack detection and resilient state estimation. IEEE Trans. Autom. Control. 2019;64:775–782. doi: 10.1109/TAC.2018.2837107. [DOI] [Google Scholar]

[B16-sensors-22-06909] 16.Candès E.J., Tao T. Decoding by linear programming. IEEE Trans. Inf. Theory. 2005;51:4203–4215. doi: 10.1109/TIT.2005.858979. [DOI] [Google Scholar]

[B17-sensors-22-06909] 17.Donoho D.L. Compressed sensing. IEEE Trans. Inf. Theory. 2006;52:1289–1306. doi: 10.1109/TIT.2006.871582. [DOI] [Google Scholar]

[B18-sensors-22-06909] 18.Pajic M., Lee I., Pappas G.J. Attack-resilient state estimation for noisy dynamical systems. IEEE Trans. Control Netw. Syst. 2017;4:82–92. doi: 10.1109/TCNS.2016.2607420. [DOI] [Google Scholar]

[B19-sensors-22-06909] 19.Mishra S., Shoukry Y., Karamchandani N., Diggavi S., Tabuada P. Secure state estimation against sensor attacks in the presence of noise. IEEE Trans. Control Netw. Syst. 2017;4:49–59. doi: 10.1109/TCNS.2016.2606880. [DOI] [Google Scholar]

[B20-sensors-22-06909] 20.Chang Y.H., Hu Q., Tomlin C.J. Secure estimation based Kalman filter for cyber-physical systems against sensor attacks. Automatica. 2018;95:399–412. doi: 10.1016/j.automatica.2018.06.010. [DOI] [Google Scholar]

[B21-sensors-22-06909] 21.Liu X., Mo Y., Garone E. Local decomposition of Kalman filters and its application for secure state estimation. IEEE Trans. Autom. Control. 2021;66:5037–5044. doi: 10.1109/TAC.2020.3044854. [DOI] [Google Scholar]

[B22-sensors-22-06909] 22.Mehra R.K., Peschon J. An innovations approach to fault detection and diagnosis in dynamic systems. Automatica. 1971;7:637–640. doi: 10.1016/0005-1098(71)90028-8. [DOI] [Google Scholar]

[B23-sensors-22-06909] 23.Brumback B., Srinath M. A chi-square test for fault-detection in Kalman filters. IEEE Trans. Autom. Control. 1987;32:552–554. doi: 10.1109/TAC.1987.1104658. [DOI] [Google Scholar]

[B24-sensors-22-06909] 24.Sun S.-L., Deng Z.-L. Multi-sensor optimal information fusion Kalman filter. Automatica. 2004;40:1017–1023. doi: 10.1016/j.automatica.2004.01.014. [DOI] [Google Scholar]

[B25-sensors-22-06909] 25.Sun S.-L. Multi-sensor optimal information fusion Kalman filters with applications. Aerosp. Sci. Technol. 2004;8:57–62. doi: 10.1016/j.ast.2003.08.003. [DOI] [Google Scholar]

[B26-sensors-22-06909] 26.Kim J., Shim H., Wu J. On distributed optimal Kalman-Bucy filtering by averaging dynamics of heterogeneous agents; Proceedings of the 55th IEEE Conference on Decision and Control; Las Vegas, NV, USA. 12–14 December 2016; pp. 6309–6314. [Google Scholar]

[B27-sensors-22-06909] 27.Kim T., Lee C., Shim H. Completely decentralized design of distributed observer for linear systems. IEEE Trans. Autom. Control. 2020;65:4664–4678. doi: 10.1109/TAC.2019.2962360. [DOI] [Google Scholar]

[B28-sensors-22-06909] 28.Lee C. Ph.D. Dissertation. Seoul National University; Seoul, Korea: 2018. Attack-Resilient Feedback Control Systems: Secure State Estimation under Sensor Attacks. [Google Scholar]

[B29-sensors-22-06909] 29.Simon D. Optimal State Estimation: Kalman, H Infinity, and Nonlinear Approaches. Wiley-Interscience; Hoboken, NJ, USA: 2006. [Google Scholar]

[B30-sensors-22-06909] 30.Kay S.M. Fundamentals of Statistical Signal Processing, Volume I: Estimation Theory. Prentice Hall PTR; Upper Saddle River, NJ, USA: 1993. [Google Scholar]

[B31-sensors-22-06909] 31.Kay S.M. Fundamentals of Statistical Signal Processing, Volume II: Detection Theory. Prentice Hall PTR; Upper Saddle River, NJ, USA: 1993. [Google Scholar]

[B32-sensors-22-06909] 32.Boyd S., Vandenberghe L. Convex Optimization. Cambridge University Press; Cambridge, UK: 2004. [Google Scholar]

[B33-sensors-22-06909] 33.Zhou K., Doyle J.C. Essentials of Robust Control. Prentice Hall; Upper Saddle River, NJ, USA: 1998. [Google Scholar]

[B34-sensors-22-06909] 34.Quanser Inc. Multi-DOF Torsion Experiment User Manual. Quanser Inc.; Markham, ON, Canada: 2012. [Google Scholar]

PERMALINK

Observability Decomposition-Based Decentralized Kalman Filter and Its Application to Resilient State Estimation under Sensor Attacks

Chanhwa Lee

Roles

Abstract

1. Introduction

2. System Modeling and Problem Formulation

2.1. Plant Modeling with Gaussian Disturbances and Noises

Assumption 1.

2.2. Attack Modeling with Sparse Sensor Attacks

Assumption 2.

2.3. Problem Formulation

Assumption 3.

3. Optimal Information Fusion Kalman Filter Based on Observability Decomposition

3.1. Kalman Observability Decomposition with Single Sensor

3.2. Decentralized Multi-Sensor Kalman Filter

3.3. Optimal Information Fusion Based on Observability Decomposition

Figure 1.

Theorem 1

Proof.

Remark 1.

4. Attack Resilient and Secure State Estimation by Decentralized Kalman Filter

4.1. Effect of Sparse Sensor Attack on Information Fusion Kalman FIlter

4.2. Detection of Sparse Sensor Attack

Definition 1

Lemma 1

Theorem 2.

Proof.

Theorem 3.

Proof.

4.3. Secure State Estimation under a Sparse Sensor Attack

Lemma 2

Figure 2.

Remark 2.

Remark 3.

5. Simulation Results

Figure 3.

Figure 4.

6. Conclusions

Abbreviations

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Funding Statement

Footnotes

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases