Information Formulation of the UDU Kalman Filter

Christopher D’Souza; Renato Zanetti

doi:10.1109/TAES.2018.2850379

. Author manuscript; available in PMC: 2020 Feb 1.

Published in final edited form as: IEEE Trans Aerosp Electron Syst. 2018 Jun 25;55(1):493–498. doi: 10.1109/TAES.2018.2850379

Information Formulation of the UDU Kalman Filter

Christopher D’Souza ¹, Renato Zanetti ²

PMCID: PMC6443377 NIHMSID: NIHMS1521718 PMID: 30948859

Abstract

A new information formulation of the Kalman filter is presented where the information matrix is parameterized as the product of an upper triangular matrix, a diagonal matrix, and the transpose of the triangular matrix (UDU factorization). The UDU factorization of the Kalman filter is known for its numerical stability, this work extends the technique to the information filter. A distinct characteristic of the new algorithm is that measurements can be processed as vectors, while the classic UDU factorization requires scalar measurement processing, i.e. a diagonal measurement noise covariance matrix.

I. Introduction

The UDU formulation of the Kalman Filter has been used in aerospace engineering applications for several decades. Thornton [1], Bierman and Thornton [2] and Bierman [3] introduced an elegant formulation where the covariance matrix P is replaced by two factors: a diagonal matrix D and an upper triangular matrix U with ones on the main diagonal, such that P = UDU^T. Whereas the UDU factorization improves the computational stability and efficiency of large navigation filters, it was originally used in a batch formulation [4]. However, this formulation lent itself to sequential implementations, well-suited for platforms where both computational stability and numerical efficiency are at a premium. It serves as the backbone of the Orion Navigation System [5].

Factorization of the covariance matrix in a Kalman filter [6] is almost as old as the filter itself. In 1963 James Potter developed a square-root formulation of the Kalman filter to implement on the Apollo onboard computer [7]. The main driver at the time was numerical precision, as computer words were only 8 bits long. Replacing the covariance by a square root matrix S, such as P = SS^T, reduces the spread of the elements of P bringing them closer to 1, doubling the numerical precision of the stored variable. Potter’s algorithm requires the computation of scalar square roots (one per measurement). At the time, the Apollo Kalman filter was designed without any process noise, because computations required for inclusion of the process noise required too many computations [8]. A very desirable by-product of this factorization is that the symmetry and semi-positive definiteness of the covariance are insured by construction, and does not need to be checked or enforced to correct for numerical and round-off errors. It should be noted that this Apollo factorization was not a triangular square root matrix.

An alternative square root covariance factorization is the Cholesky factorization [9], [10]. The Cholesky method is very similar to Potter’s but computes the square root of the covariance matrix with a Cholesky decomposition (S is a triangular matrix) [11]. Another relevant covariance factorization work is that proposed by Oshman and Bar-Itzhack [12], which utilizes the spectral decomposition of the covariance matrix.

The UDU factorization is not a square root filter; the numerical precision of the stored variable does not increase due to the factorization. For example, if P is diagonal, U = I and D = P; therefore the full range of values in P are preserved in this factorization. However, the UDU formulation of the Kalman filter has great numerical stability properties [3]; it insures symmetry of the covariance by construction, and it requires a trivial check and correction to ensure semi-positive definiteness (it suffices to enforce that the diagonal elements of D remain non-negative). The UDU formulation is free from square root operations, making it computationally cheaper than the Cholesky approach. For these reasons the UDU has endured as one of the preferred practical implementation of Kalman filters in aerospace applications.

While the UDU factorization is well known, it has never been applied to the information formulation of the Kalman filter [13], [14], [15]. In this formulation the inverse of the covariance matrix, known as the information matrix, is carried in the recursive algorithm rather than the covariance matrix itself. The information formulation is a popular approach in several situations. In particular, the Square Root Information Filter (SRIF) [16], [17], [18], [3], [19], [20] is a go-to Kalman filter factorization method used in orbit determination packages such as Monte because of its great stability and accuracy. In this work we introduce the UDU Information Filter, a never developed before algorithm with two key properties that make it a very desirable implementation of a recursive estimator: i. unlike the regular UDU filter, measurements do not need to be processed as scalars, i.e. the measurement noise covariance matrix R does not need to be diagonal or diagonalized, and ii. unlike the regular information formulation the state estimation error covariance matrix does not actually need to be inverted.

II. Background

The well-known Kalman filter measurement update equations are given by

{\hat{x}}_{k} = {\overset{‒}{x}}_{k} + K_{k} (y_{k} - H_{k} {\overset{‒}{x}}_{k})

(1)

P_{k} = {\overset{‒}{P}}_{k} - {\overset{‒}{P}}_{k} H_{k}^{T} {(H_{k} {\overset{‒}{P}}_{k} H_{k}^{T} + R_{k})}^{- 1} H_{k} {\overset{‒}{P}}_{k} = (I - K_{k} H_{k}) {\overset{‒}{P}}_{k}

(2)

K_{k} = {\overset{‒}{P}}_{k} H_{k}^{T} {(H_{k} {\overset{‒}{P}}_{k} H_{k}^{T} + R_{k})}^{- 1} = P_{k} H_{k}^{T} R_{k}^{- 1}

(3)

where the bar represents the a priori value, K_k is the n × m Kalman gain, $x \in ℜ^{n}$ is the state vector, P_k is the n × n estimation error covariance matrix, $y \in ℜ^{m}$ is the measurement vector defined as

y_{k} = H_{k} x_{k} + η_{k}

(4)

where η_k is a zero mean, white sequence with covariance matrix R_k. The propagation equations are

{\overset{‒}{x}}_{k + 1} = Φ (t_{k + 1}, t_{k}) {\hat{x}}_{k}

(5)

{\overset{‒}{P}}_{k + 1} = Φ (t_{k + 1}, t_{k}) P_{k} Φ {(t_{k + 1}, t_{k})}^{T} + G_{k} Q_{k} G_{k}^{T} = Φ_{k} P_{k} Φ_{k}^{T} + G_{k} Q_{k} G_{k}^{T}

(6)

where Φ(t_k+1, t_k) (which we will denote as Φ_k;) is the state n × n transition matrix from t_k to t_k+1, Q_k is the p × p process noise covariance matrix, and G_k is the n × p process noise shaping matrix.

The UDU factorization implements the above equations by replacing the covariance matrix P_k with an upper triangular matrix with ones on the diagonal (U_k) and a diagonal matrix D_k, such that

P_{k} = U_{k} D_{k} U_{k}^{T}

(7)

The UDU approach to propagate U_k and D_k forward in time makes use of the Modified Weighted Gram-Schmidt (MWGS) orthogonalization algorithm that avoids loss of orthogonality due to round-off errors [21]. Measurements are processed one at the time as scalars by noting that when R_k is diagonal the update in Eq. (2) is obtained by recursively processing one element of y_k at a time, using the corresponding row of H_k and diagonal element of R_k. The measurement residual covariance matrix $W_{k} = H_{k} {\overset{‒}{P}}_{k} H_{k}^{T} + R_{k}$ thus becomes a scalar, and the quantity ${\overset{‒}{P}}_{k} H_{k}^{T} = w_{k}$ becomes a vector; thus each of the scalar updates takes the form

P_{k} = {\overset{‒}{P}}_{k} - \frac{1}{W_{k}} w_{k} w_{k}^{T}

(8)

since matrix P̄_k is updated with a rank one matrix ( $\frac{1}{W_{k}} w_{k} w_{k}^{T}$ ), we call this a rank one update, Agee and Turner [22] detailed how to directly update the U_k and D_k factors due to a rank one update. The subtraction in Eq. (8) could cause some numerical instabilities in Agee-Turner’s algorithm. Carlson [23] introduced an alternative rank-one update algorithm that, while less generic, is more stable for the measurement update. Carlson’s rank-one update is not valid for generic values of w_k and W_k, but only when the update is done with the optimal Kalman gain.

An alternative formulation of the Kalman filter is the information formulation, where the covariance matrix P is replaced by its inverse. The covariance update and Kalman gain are calculated as [13]

P_{k}^{- 1} = {\overset{‒}{P}}_{k}^{- 1} + H_{k}^{T} R_{k}^{- 1} H_{k}

(9)

K_{k} = P_{k} H_{k}^{T} R_{k}^{- 1}

(10)

The information formulation is particularly useful when there is no prior information, i.e. P₀ = ∞, in this case the covariance formulation of the KF is not defined, while the information formulation is, and starts from $P_{0}^{- 1} = O$ . In the covariance formulation, the m × m measurement residual covariance matrix $W_{k} = H_{k} {\overset{‒}{P}}_{k} H_{k}^{T} + R_{k}$ is inverted to process the measurement, while in the information formulation the n × n covariance matrix is inverted in the time propagation step. In situations when m > n, therefore, the information formulation could be computationally cheaper, although measurements are often processed one at the time as scalars. Processing measurements as scalars is only possible when R_k is diagonal, otherwise the additional steps of a change of variables to diagonalize R_k is required.

III. The UDU Information Filter

A. The Measurement Update

Begin with factorizing the covariance P into an LDL form; that is, rather than using an upper triangular matrix we will use a lower triangular matrix. We denote the diagonal matrix with Δ

P_{k} = L_{k} Δ_{k} L_{k}^{T} and {\bar{P}}_{k} = {\bar{L}}_{k} {\bar{Δ}}_{k} {\bar{L}}_{k}^{T}

(11)

and we define U and D as the inverses of L^T and Δ, respectively

P_{k}^{- 1} = L_{k}^{- T} Δ_{k}^{- 1} L_{k}^{- 1} = U_{k} D_{k} U_{k}^{T}

(12)

{\bar{P}}_{k}^{- 1} = {\bar{L}}_{k}^{- T} {\bar{Δ}}_{k}^{- 1} {\bar{L}}_{k}^{- 1} = {\bar{U}}_{k} {\bar{D}}_{k} {\bar{U}}_{k}^{T}

(13)

so that the measurement update (Eq. (9)) becomes

U_{k} D_{k} U_{k}^{T} = {\bar{U}}_{k} {\bar{D}}_{k} {\bar{U}}_{k}^{T} + H_{k}^{T} R_{k}^{- 1} H_{k}

(14)

We now factorize the m × m matrix R_k into LDL form as

R_{k} = L_{R_{k}} Δ_{R_{k}} L_{R_{k}}^{T}

(15)

and

R_{k}^{- 1} = L_{R_{k}}^{- T} Δ_{R_{k}}^{- 1} L_{R_{k}}^{- 1} = U_{R_{k}} D_{R_{k}} U_{R_{k}}^{T}

(16)

so that Eq. (14) becomes

U_{k} D_{k} U_{k}^{T} = {\bar{U}}_{k} {\bar{D}}_{k} {\bar{U}}_{k}^{T} + H_{k}^{T} U_{R_{k}} D_{R_{k}} U_{R_{k}}^{T} H_{k}

(17)

We now work on the term U_Rk H where we note that it is of dimension m × n so that it can be expressed as

U_{R_{k}}^{T} H_{R} = [\begin{matrix} v_{1}^{T} \\ v_{2}^{T} \\ ⋮ \\ v_{m}^{T} \end{matrix}]

(18)

where each v_i is an n × 1 vector.

The factor $H_{k}^{T} R_{k}^{- 1} H_{k}$ can be expressed as

H_{k}^{T} R_{k}^{- 1} H_{k} = H_{k}^{T} U_{R_{k}} D_{R_{k}} U_{R_{k}}^{T} H_{k} = {[\begin{matrix} v_{1}^{T} \\ v_{2}^{T} \\ ⋮ \\ v_{m}^{T} \end{matrix}]}^{T} [\begin{matrix} 1 ∕ d_{1_{R}} & 0 & \dots & 0 \\ 0 & 1 ∕ d_{2_{R}} & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & 1 ∕ d_{m_{R}} \end{matrix}] [\begin{matrix} v_{1}^{T} \\ v_{2}^{T} \\ ⋮ \\ v_{m}^{T} \end{matrix}] = \sum_{i = 1}^{m} \frac{1}{d_{i_{R}}} v_{i} v_{i}^{T}

(19)

so that the measurement update equation is now

U_{k} D_{k} U_{k}^{T} = {\bar{U}}_{k} {\bar{D}}_{k} {\bar{U}}_{k}^{T} + \sum_{i = 1}^{m} \frac{1}{d_{i_{R}}} v_{i} v_{i}^{T}

(20)

Thus it reduces to a series of m rank-one updates.

Notice that Eq. (20) is the update equation due to a vector measurement so that the need to process scalar measurements, as in the covariance UDU formulation, is avoided in the proposed information UDU formulation. Rather than performing an eigenvalue decomposition of R_k and a corresponding change of variables for y_k, we simply perform the UDU factorization of $R_{k}^{- 1}$ .

As stated earlier, one of the benefits of using an information formulation is that if P₀ is singular, this allows for an estimate to be obtained [14]. When P₀ is singular, x₀ is not completely defined. To this end, ẑ_k and z̄_k, which are directly related to x̂_k and x̄_k, are defined as

{\hat{z}}_{k} ≜ P_{k}^{- 1} {\hat{x}}_{k}, {\overset{‒}{z}}_{k} ≜ {\overset{‒}{P}}_{k}^{- 1} {\overset{‒}{x}}_{k}

(21)

Premultiplying Eq. (1) by $P_{k}^{- 1}$ , we get

{\hat{z}}_{k} = P_{k}^{- 1} (I - K_{k} H_{k}) {\overset{‒}{x}}_{k} + P_{k}^{- 1} K_{k} y_{k}

(22)

= {\overset{‒}{z}}_{k} + H_{k}^{T} R_{k}^{- 1} y_{k}

(23)

B. The Time Update

Prior to propagation, the standard information formulation of the Kalman filter inverts the information matrix to obtain the covariance, it then propagates the covariance with Eq. (6), and finally inverted the propagated covariance matrix to prepare for the measurement update phase. We propose an algorithm that propagates the factors of the information matrix directly. Starting from the covariance propagation Eq. (6), we factorize Q_k via a UDU parameterization so that

Q_{k} = U_{Q_{k}} Δ_{Q_{k}} U_{Q_{k}}^{T}

(24)

where Δ_{Q_k} is a diagonal p × p matrix and define G_{Q_k} as

G_{Q_{k}} ≜ G_{k} U_{Q_{k}}

(25)

so that P̅_k becomes

{\bar{P}}_{k + 1} = Φ_{k} P_{k} Φ_{k}^{T} + G_{Q_{k}} Δ_{Q_{k}} G_{Q_{k}}^{T}

(26)

Invoking the matrix inversion lemma

{(Z + XAY)}^{- 1} = Z^{- 1} - Z^{- 1} X {(A^{- 1} + Y Z^{- 1} X)}^{- 1} Y Z^{- 1}

(27)

and letting

Z = Φ_{k} P_{k} Φ_{k}^{T}; A = Δ_{Q_{k}}; X = G_{Q_{k}}; Y = G_{Q_{k}}^{T}

(28)

and

Z^{- 1} = M_{k} ≜ Φ_{k}^{- T} P_{k}^{- 1} Φ_{k}^{- 1}

(29)

The inverse of the propagated covariance is

{\bar{P}}_{k + 1}^{- 1} = M_{k} - M_{k} G_{Q_{k}} {[G_{k}^{T} M_{k} G_{Q_{k}} + Δ_{Q_{k}}^{- 1}]}^{- 1} G_{Q_{k}}^{T} M_{k}

(30)

Defining

{\bar{G}}_{k} ≜ Φ_{k}^{- 1} G_{Q_{k}} D_{Q_{k}} = Δ_{Q_{k}}^{- 1}

(31)

$P_{k + 1}^{- 1}$ becomes

{\bar{P}}_{k + 1}^{- 1} = Φ_{k}^{- T} {P_{k}^{- 1} - P_{k}^{- 1} {\bar{G}}_{k} {[{\bar{G}}_{k}^{T} P_{k}^{- 1} {\bar{G}}_{k} + D_{Q_{k}}]}^{- 1} {\bar{G}}_{k}^{T} P_{k}^{- 1}} Φ_{k}^{- 1}

(32)

and defining $K_{k}$ as

K_{k} ≜ P_{k}^{- 1} {\bar{G}}_{k} {[{\bar{G}}_{k}^{T} P_{k}^{- 1} {\bar{G}}_{k} + D_{Q_{k}}]}^{- 1}

(33)

Defining the quantity inside the brackets in Eq. (32) as $P_{k}^{- 1}$

P_{k}^{- 1} ≜ P_{k}^{- 1} - P_{k}^{- 1} {\bar{G}}_{k} {[{\bar{G}}_{k}^{T} P_{k}^{- 1} {\bar{G}}_{k} + D_{Q_{k}}]}^{- 1} {\bar{G}}_{k}^{T} P_{k}^{- 1} = [I - K_{k} {\bar{G}}_{k}^{T}] P_{k}^{- 1}

(34)

We notice Eq. (34) is an analog to Eq. (2) with

\begin{matrix} P_{k}^{- 1} \to P_{k} \\ P_{k}^{- 1} \to {\overset{‒}{P}}_{k} \end{matrix} \begin{matrix} {\bar{G}}_{k} \to H_{k}^{T} \\ K_{k} \to K_{k} \end{matrix} \begin{matrix} D_{Q_{k}} \to R_{k} \end{matrix}

and since D_{Q_k} is a diagonal p × p matrix, we can solve for the UDU factorization of $P_{k}^{- 1}$ directly by using a Carlson Rank-One Update [23] performed p times on

U_{k} D_{k} U_{k}^{T} = U_{k} D_{k} U_{k}^{T} - U_{k} D_{k} U_{k}^{T} {\bar{G}}_{k} {[{\bar{G}}_{k}^{T} U_{k} D_{k} U_{k}^{T} {\bar{G}}_{k} + Δ_{Q_{k}}^{- 1}]}^{- 1} {\bar{G}}_{k}^{T} U_{k} D_{k} U_{k}

(35)

so that we can find the time-propagated UDU factors of ${\bar{P}}_{k}^{- 1}$ as

{\bar{P}}_{k + 1}^{- 1} = {\bar{U}}_{k + 1} {\bar{D}}_{k + 1} {\bar{U}}_{k + 1}^{T} = Φ_{k}^{- T} U_{k} D_{k} U_{k}^{T} Φ_{k}^{- 1}

(36)

Since this equation is equivalent to a covariance propagation without process noise, the MWGS orthogonalization algorithm can be used to obtain the factors U̅_k+1 and D̅_k+1.

Notice that $Φ_{k}^{- 1}$ does not necessarily need to be computed by a direct matrix inversion. Usually Φ_k is computed via integration of a matrix differential equation or by series approximation. Similarly, $Φ_{k}^{- 1}$ can be obtained directly by backwards integration or by series approximation.

Beginning with Eq. (5) the time update for z̄_k+1 is obtained as follows

{\overset{‒}{P}}_{k + 1}^{- 1} {\overset{‒}{x}}_{k + 1} = {\overset{‒}{P}}_{k + 1}^{- 1} Φ_{k} P_{k} P_{k}^{- 1} {\hat{x}}_{k}

(37)

which becomes

{\overset{‒}{z}}_{k + 1} = {\overset{‒}{P}}_{k + 1}^{- 1} Φ_{k} P_{k} {\hat{z}}_{k}

(38)

substituting from Eqs. (32) and (34)

{\overset{‒}{z}}_{k + 1} = Φ_{k}^{- T} [I - K_{k} {\bar{G}}_{k}^{T}] {\hat{z}}_{k}

(39)

{\overset{‒}{z}}_{k + 1} = Φ^{- T} (t_{k + 1}, t_{k}) [I - K_{k} {\bar{G}}_{k}^{T}] {\hat{z}}_{k}

(40)

The following table summarizes the UDU Information Filter. While Table I contains a compact notation for the covariance time propagation and measurement update; in the actual algorithm the covariance factors U_k and D_k are individually propagated and updated using the Rank-1 Update and the Modified Weighted Gram-Schmidt orthogonalization algorithms.

TABLE I.

Summary of UDU Information Filter Equations

Initialization
State	z₀ = P⁻¹(t₀)x(t₀)
Covariance	$U_{0} D_{0} U_{0}^{T} = P^{- 1} (t_{0})$
Time Propagation
Truth	x_k+1 = Φ_kx_t + G_kν_k, ν_k ∼ n(0, Q_k)
Process Noise	$U_{Q_{k}} Δ_{Q_{k}} U_{Q_{k}}^{T} = Q_{k}, {\bar{G}}_{k} = Φ_{k}^{- 1} G_{k} U_{Q_{k}}$
Gain	$K_{k} = U_{k} D_{k} U_{k}^{T} {\bar{G}}_{k} {[{\bar{G}}_{k}^{T} U_{k} D_{k} U_{k}^{T} {\bar{G}}_{k} + Δ_{Q_{k}}]}^{- 1}$
Covariance	$U_{k} D_{k} U_{k}^{T} = ⌊ I - K_{k} {\bar{G}}_{k}^{T} ⌋ U_{k} D_{k} U_{k}^{T}$ (Rank-1 Update) ${\bar{U}}_{k + 1} {\bar{D}}_{k + 1} {\bar{U}}_{k + 1}^{T} = Φ_{k}^{- T} U_{k} D_{k} U_{k}^{T} Φ_{k}^{- 1}$ (MWGS)
State	${\bar{z}}_{k + 1} = Φ_{k}^{- T} ∣ I - K_{k} {\bar{G}}_{k}^{T} ∣ {\hat{z}}_{k}$
Measurement Update
Truth	y_k+1 = H_k+1x_k + η_k+1, η_k+1 ∼ n(0, R_k+1
Meas. Noise	$U_{R_{k + 1}} D_{R_{k + 1}} U_{R_{k + 1}}^{T} = R_{k + 1}^{- 1} \sum_{i = 1}^{m} \frac{1}{d_{i_{R}}} v_{i} v_{i}^{T} = H_{k + 1}^{T} U_{R_{k + 1}} D_{R_{k + 1}} U_{R_{k + 1}}^{T} H_{k + 1}$
Covariance	$U_{k + 1} D_{k + 1} U_{k + 1}^{T} = {\bar{U}}_{k + 1} {\bar{D}}_{k + 1} {\bar{U}}_{k + 1}^{T} + + \sum_{i = 1}^{m} (1 ∕ d_{i_{R}}) v_{i} v_{i}^{T}$ (Rank-1 Update)
State	${\hat{z}}_{k + 1} = {\bar{z}}_{k + 1} + H_{k + 1} R_{k + 1}^{- 1} y_{k + 1}$

Open in a new tab

C. An Efficient Algorithm to compute U⁻¹

The algorithm proposed does not necessitate to invert the covariance matrix nor its U or L factor. However, in case the initial covariance was provided, it might be convenient to factorize it first, and to efficiently invert its factors rather than inverting the full covariance. In this section we compute the inverse in an efficient manner, taking advantage of the ‘1’s’ and ‘0’s’. It is as follows: Given an n × n upper triangular ‘unit’ matrix U expressed as

U = [\begin{matrix} 1 & U_{1, 2} & U_{1, 3} & \dots & U_{1, n - 1} & U_{1, n} \\ 0 & 1 & U_{2, 3} & \dots & U_{2, n - 1} & U_{2, n} \\ 0 & 0 & 1 & \dots & U_{3, n - 1} & U_{3, n} \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ & ⋮ \\ 0 & 0 & 0 & \dots & 1 & U_{n - 1, n} \\ 0 & 0 & 0 & \dots & 0 & 1 \end{matrix}]

(41)

the inverse is also an n × n upper triangular ‘unit’ matrix V (so that det (V) = 1) which is

V = U^{- 1} = [\begin{matrix} 1 & V_{1, 2} & V_{1, 3} & \dots & V_{1, n - 1} & V_{1, n} \\ 0 & 1 & V_{2, 3} & \dots & V_{2, n - 1} & V_{2, n} \\ 0 & 0 & 1 & \dots & V_{3, n - 1} & V_{3, n} \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ & ⋮ \\ 0 & 0 & 0 & \dots & 1 & V_{n - 1, n} \\ 0 & 0 & 0 & \dots & 0 & 1 \end{matrix}]

(42)

since

UV = I

(43)

and the ij-th element of UV is given by

\sum_{k = i}^{j} U_{i, k} V_{k, j} = U_{i, i} V_{i, j} + U_{i, j} V_{j, j} + \sum_{k = i + 1}^{j - 1} U_{i, k} V_{k, j} = = V_{i, j} + U_{i, j} + \sum_{k = i + 1}^{j - 1} U_{i, k} V_{k, j}

(44)

we can solve for the elements of V as

j = n, \dots, 2, i = j - 1, \dots, 1 V_{i, j} = - [U_{i, j} + \sum_{k = i + 1}^{j - 1} U_{i, k} V_{k, j}]

(45)

IV. A Numerical Example

In this section we show the performance of the algorithm in linear, time-varying example with correlated measurement noise covariance matrix R. The system is given by

x_{k + 1} = Φ_{k} x_{k} + ν_{k}

(46)

y_{k} = H_{k} x_{k} + η_{k}

(47)

Φ_{k} = [\begin{matrix} I & A_{k} \\ B_{k} & I \end{matrix}]

(48)

A_{k} = [\begin{matrix} t_{k} - t_{k - 1} & 0 \\ 0 & t_{k} - t_{k - 1} \end{matrix}]

(49)

B_{k} = 0.1 [\begin{matrix} \sin (t_{k}) - \sin (t_{k - 1}) & - (\cos (t_{k}) - \cos (t_{k - 1})) \\ 0 & \sin (t_{k}) - \sin (t_{k - 1}) \end{matrix}]

(50)

H_{k} = [\begin{matrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \end{matrix}]

(51)

where I is the identity matrix t_k − t_k-1 = 1 second, ν_k is a zero mean, Gaussian white sequence with covariance matrix Q_k = 0.01 I, and η_k is a zero mean, Gaussian white sequence with covariance matrix R_k

R_{k} = [\begin{matrix} 2.96 & 2.8 \\ 2.8 & 2.96 \end{matrix}]

(52)

The initial estimate is unbiased, and the initial estimation error is Gaussian with covariance P₀ = I. Fig. 1 shows the result of a single run and the 3σ predicted standard deviations from a straight formulation of a Kalman filter. In order to show the equivalence between the Kalman filter (KF) and the UDU information approach (UDUI), Fig. 2 shows the norm of the difference between the two state estimates

ϵ_{x} = ‖ {\hat{x}}_{KF} - {\hat{x}}_{UDUI} ‖

while Fig. 3 compares the Kalman filter covariance P_KF with the UDU factorization of the Information matrix $P_{U D U I}^{- 1} = U D U^{T}$ by plotting the following quantity:

ϵ_{P} = ‖ P_{KF} P_{UDUI}^{- 1} - I ‖

Fig. 1. — Estimation Error and 3σ predicted standard deviations

Fig. 2. — Norm of the State Error (||x̂*_KF* − x̂*_UDUI*||)

Fig. 3. — Norm of the Covariance error ( $‖ P_{K F} P_{U D U I}^{- 1} - I ‖$ )

The figures show that the proposed algorithm results closely match the Kalman filter hence validating the proposed algorithm as its UDU information formulation. The growth of the error in Figs. 2 and 3 is due to the accumulation of round-off errors in the algorithms. It is known [3] that numerical errors accumulate faster in the full covariance formulation of the Kalman filter than in the UDU’s.

V. Conclusions

A new algorithmic mechanization of the classic Kalman filter is presented, the new algorithm combines the information formulation with the UDU factorization. While the covariance formulation of the Kalman filter is usually employed, the information formulation has distinct advantages in some applications, for example when no initial condition is available. The UDU factorization is a widely adopted technique to produce a numerically stable and accurate algorithm to keep the covariance matrix symmetric and positive definite. A numerical example confirms the equivalency between the Kalman filter and the proposed algorithm.

Acknowledgments

This work was supported in part by NASA JSC contract NNX17AI35A.

Biography

graphic file with name nihms-1521718-b0001.gif

Christopher D’Souza is a Navigation Engineer in the Aerosciences and Flight Mechanics Division at the Johnson Space Center, Houston, TX. He has developed navigation filters and guidance laws for the US Air Force and NASA. He has worked at the Jet Propulsion Laboratory, the Air Force Research Laboratory, Draper Laboratory and the NASA Johnson Space Center. His areas of research include onboard navigation algorithms, covariance analysis, guidance laws, and trajectory optimization. He received his Ph.D. in aerospace engineering from the University of Texas in 1991.

graphic file with name nihms-1521718-b0002.gif

Renato Zanetti received his undergraduate degree from Politecnico di Milano, Italy in 2003 and his Ph.D. from the University of Texas at Austin in 2007, both in aerospace engineering. From 2007 to 2013 he was a senior member of the technical staff at the Charles Stark Draper Laboratory, in the Houston, Texas office. From 2013 to 2017 he was an Engineer at the NASA Johnson Space Center in Houston, TX. In 2017 he joined the faculty of the Aerospace Engineering and Engineering Mechanics department at the University of Texas at Austin. His projects include autonomous navigation for various NASA and commercial space vehicles. His research work focuses on nonlinear/non-Gaussian estimation, attitude dynamics and control, and autonomous vehicles.

Contributor Information

Christopher D’Souza, Aeroscience and Flight Mechanics Division, EG6, 2101 NASA Parkway, NASA Johnson Space Center, Houston, Texas 77058..

Renato Zanetti, Department of Aerospace Engineering and Engineering Mechanics, The University of Texas at Austin, Austin, Texas 78712.

References

[1].Thornton C, “Triangular Covariance Factorizations for Kalman Filtering,” Ph.D. dissertation, University of California at Los Angeles, October 1976. [Google Scholar]
[2].Bierman G and Thornton C, “Numerical comparison of kalman filter algorithms: Orbit determination case study,” Automatica, vol. 13, pp. 23–25, 1977. [Google Scholar]
[3].Bierman GJ, Factorization Methods for Discrete Sequential Estimation. New York: Dover Publications, 2006. [Google Scholar]
[4].Evans S, Taber W, Drain T, Smith J, Wu H-C, Guevara M, Sunseri R, and Evans J, “Monte: The next generation of mission design and navigation software,” 6th International Conference on Astrodynamics Tools and Techniques (ICATT), March 2016. [Google Scholar]
[5].Sud J, Gay R, Holt G, and Zanetti R, “Orion Exploration Flight Test 1 (EFT1) Absolute Navigation Design,” in Proceedings of the AAS Guidance and Control Conference, ser Advances in the Astronautical Sciences, vol. 151, Breckenridge, CO, January 31-February 5, 2014 2014, pp. 499–509, aAS 14–092. [Google Scholar]
[6].Kalman R, “A new approach to linear filtering and prediction problems,” Trans. ASME (J. Basic Eng.), vol. 82D, pp. 34–45, March 1960. [Google Scholar]
[7].Potter J and Stern R, “Statistical filtering of space navigation measurements,” Proceedings of the AIAA Guidance and Control Conference, 1963. [Google Scholar]
[8].Battin R, An Introduction to the Mathematics and Methods of Astrodynamics. Reston, VA: AIAA, 1999. [Google Scholar]
[9].Golub GH and Loan CFV, Matrix Computations, 2nd ed. Baltimore, MD: John Hopkins University Press, 1989, pp. 141–142. [Google Scholar]
[10].Verhaegen M and Van Dooren P, “Numerical aspects of different kalman filter implementations,” IEEE Transactions on Automatic Control, vol. 31, no. 10, pp. 907–917, October 1986. [Google Scholar]
[11].Kaminski P, Bryson A, and Schmidt S, “Discrete square root filtering: A survey of current techniques,” IEEE Transactions on Automatic Control, vol. 16, no. 6, pp. 727–736, December 1971. [Google Scholar]
[12].Oshman Y and Bar-Itzhack IY, “Square root filtering via covariance and information eigenfactors,” Automatica, vol. 22, no. 5, pp. 599–604, 1986. [Google Scholar]
[13].Brown RG and Hwang PY, Introduction To Random Signals And Applied Kalman Filtering, 3rd ed. John Wiley and Sons, 1997, pp. 246–250. [Google Scholar]
[14].Maybeck P, Stochastic Models, Estimation and Control, Vol. 1 New York, NY: Academic Press, 1979. [Google Scholar]
[15].——, Stochastic Models, Estimation and Control, Vol. 2 New York, NY: Academic Press, 1979. [Google Scholar]
[16].Dyer P and McReynolds S, “Extension of square-root filtering to include process noise,” Journal of Optimization Theory and Applications, vol. 3, no. 6, pp. 444–458, December 1969. [Google Scholar]
[17].Golub G, “Numerical methods for solving linear least squares problems,” Numerische Mathematik, vol. 7, no. 3, pp. 206–216, June 1965, doi: 10.1007/BF01436075. [DOI] [Google Scholar]
[18].Hanson RJ and Lawson CL, “Extensions and applications of the householder algorithm for solving linear least squares problems,” Mathematics of Computation, vol. 23, no. 108, pp. 787–812, 1969. [Google Scholar]
[19].Bierman G, “Sequential square root filtering and smoothing of discrete linear systems,” Automatica, vol. 10, pp. 147–158, 1974. [Google Scholar]
[20].——, “The treatment of bias in the square root information filter/smoother,” Journal of Optimization Theory and Applications, vol. 16, no. 1/2, pp. 165–178, July 1975. [Google Scholar]
[21].Thornton C and Bierman G, “Gram-schmidt algorithms for covariance propagation,” IEEE Conference on Decision and Control, pp. 489–498, 1975. [Google Scholar]
[22].Agee W and Turner R, “Triangular decomposition of a positive definite matrix plus a symmetric dyad with application to kalman filtering,” White Sands Missile Range Tech. Rep. No. 38, 1972. [Google Scholar]
[23].Carlson N, “Fast triangular factorization of the square root filter,” AIAA Journal, vol. 11, no. 9, pp. 1259–1265, September 1973. [Google Scholar]

[R1] [1].Thornton C, “Triangular Covariance Factorizations for Kalman Filtering,” Ph.D. dissertation, University of California at Los Angeles, October 1976. [Google Scholar]

[R2] [2].Bierman G and Thornton C, “Numerical comparison of kalman filter algorithms: Orbit determination case study,” Automatica, vol. 13, pp. 23–25, 1977. [Google Scholar]

[R3] [3].Bierman GJ, Factorization Methods for Discrete Sequential Estimation. New York: Dover Publications, 2006. [Google Scholar]

[R4] [4].Evans S, Taber W, Drain T, Smith J, Wu H-C, Guevara M, Sunseri R, and Evans J, “Monte: The next generation of mission design and navigation software,” 6th International Conference on Astrodynamics Tools and Techniques (ICATT), March 2016. [Google Scholar]

[R5] [5].Sud J, Gay R, Holt G, and Zanetti R, “Orion Exploration Flight Test 1 (EFT1) Absolute Navigation Design,” in Proceedings of the AAS Guidance and Control Conference, ser Advances in the Astronautical Sciences, vol. 151, Breckenridge, CO, January 31-February 5, 2014 2014, pp. 499–509, aAS 14–092. [Google Scholar]

[R6] [6].Kalman R, “A new approach to linear filtering and prediction problems,” Trans. ASME (J. Basic Eng.), vol. 82D, pp. 34–45, March 1960. [Google Scholar]

[R7] [7].Potter J and Stern R, “Statistical filtering of space navigation measurements,” Proceedings of the AIAA Guidance and Control Conference, 1963. [Google Scholar]

[R8] [8].Battin R, An Introduction to the Mathematics and Methods of Astrodynamics. Reston, VA: AIAA, 1999. [Google Scholar]

[R9] [9].Golub GH and Loan CFV, Matrix Computations, 2nd ed. Baltimore, MD: John Hopkins University Press, 1989, pp. 141–142. [Google Scholar]

[R10] [10].Verhaegen M and Van Dooren P, “Numerical aspects of different kalman filter implementations,” IEEE Transactions on Automatic Control, vol. 31, no. 10, pp. 907–917, October 1986. [Google Scholar]

[R11] [11].Kaminski P, Bryson A, and Schmidt S, “Discrete square root filtering: A survey of current techniques,” IEEE Transactions on Automatic Control, vol. 16, no. 6, pp. 727–736, December 1971. [Google Scholar]

[R12] [12].Oshman Y and Bar-Itzhack IY, “Square root filtering via covariance and information eigenfactors,” Automatica, vol. 22, no. 5, pp. 599–604, 1986. [Google Scholar]

[R13] [13].Brown RG and Hwang PY, Introduction To Random Signals And Applied Kalman Filtering, 3rd ed. John Wiley and Sons, 1997, pp. 246–250. [Google Scholar]

[R14] [14].Maybeck P, Stochastic Models, Estimation and Control, Vol. 1 New York, NY: Academic Press, 1979. [Google Scholar]

[R15] [15].——, Stochastic Models, Estimation and Control, Vol. 2 New York, NY: Academic Press, 1979. [Google Scholar]

[R16] [16].Dyer P and McReynolds S, “Extension of square-root filtering to include process noise,” Journal of Optimization Theory and Applications, vol. 3, no. 6, pp. 444–458, December 1969. [Google Scholar]

[R17] [17].Golub G, “Numerical methods for solving linear least squares problems,” Numerische Mathematik, vol. 7, no. 3, pp. 206–216, June 1965, doi: 10.1007/BF01436075. [DOI] [Google Scholar]

[R18] [18].Hanson RJ and Lawson CL, “Extensions and applications of the householder algorithm for solving linear least squares problems,” Mathematics of Computation, vol. 23, no. 108, pp. 787–812, 1969. [Google Scholar]

[R19] [19].Bierman G, “Sequential square root filtering and smoothing of discrete linear systems,” Automatica, vol. 10, pp. 147–158, 1974. [Google Scholar]

[R20] [20].——, “The treatment of bias in the square root information filter/smoother,” Journal of Optimization Theory and Applications, vol. 16, no. 1/2, pp. 165–178, July 1975. [Google Scholar]

[R21] [21].Thornton C and Bierman G, “Gram-schmidt algorithms for covariance propagation,” IEEE Conference on Decision and Control, pp. 489–498, 1975. [Google Scholar]

[R22] [22].Agee W and Turner R, “Triangular decomposition of a positive definite matrix plus a symmetric dyad with application to kalman filtering,” White Sands Missile Range Tech. Rep. No. 38, 1972. [Google Scholar]

[R23] [23].Carlson N, “Fast triangular factorization of the square root filter,” AIAA Journal, vol. 11, no. 9, pp. 1259–1265, September 1973. [Google Scholar]

PERMALINK

Information Formulation of the UDU Kalman Filter

Christopher D’Souza

Renato Zanetti

Abstract

I. Introduction

II. Background