Skip to main content
Springer Nature - PMC COVID-19 Collection logoLink to Springer Nature - PMC COVID-19 Collection
. 2022 Jul 21;36(12):4337–4354. doi: 10.1007/s00477-022-02266-3

Multivariate Kalman filtering for spatio-temporal processes

Guillermo Ferreira 1,, Jorge Mateu 2, Emilio Porcu 3,4
PMCID: PMC9303052  PMID: 35892061

Abstract

An increasing interest in models for multivariate spatio-temporal processes has been noted in the last years. Some of these models are very flexible and can capture both marginal and cross spatial associations amongst the components of the multivariate process. In order to contribute to the statistical analysis of these models, this paper deals with the estimation and prediction of multivariate spatio-temporal processes by using multivariate state-space models. In this context, a multivariate spatio-temporal process is represented through the well-known Wold decomposition. Such an approach allows for an easy implementation of the Kalman filter to estimate linear temporal processes exhibiting both short and long range dependencies, together with a spatial correlation structure. We illustrate, through simulation experiments, that our method offers a good balance between statistical efficiency and computational complexity. Finally, we apply the method for the analysis of a bivariate dataset on average daily temperatures and maximum daily solar radiations from 21 meteorological stations located in a portion of south-central Chile.

Supplementary Information

The online version contains supplementary material available at 10.1007/s00477-022-02266-3.

Keywords: Cross-covariance, Geostatistics, Kalman filter, State space system, Time-varying models

Introduction

The use of state-space models has been quite popular in past literature, and efficient algorithms such as Kalman Filter (KF henceforth) have been developed for the modeling of sophisticated time series models, (see e.g. Ferreira et al. 2013; Grassi and de Magistris 2014; Kim et al. 2014; Cheng et al. 2015; Wang and Chaib-draa 2016). Once the state-space representation is established, the KF algorithm can be used to estimate the state vector, as well as the model parameters, and build the multi-step-ahead predictor of the process and its mean square error matrix. The idea of developing techniques for the estimation of an unobserved state from the observed process goes back to the sixties. A friendly introduction to the general idea of the KF is offered in Kalman (1960), as well as in Maybeck (1979). More extensive references include (Hamilton 2020; Durbin and Koopman 2001; Grewal et al. 2001; Grewal 2011). It is well-known that the KF algorithm provides an efficient mean to evaluate the maximum likelihood estimates of Gaussian processes. This outperformance makes it a natural candidate to make statistical inferences for spatio-temporal processes. In this context, the KF has been implemented to perform inference and prediction for processes in the physical, ecological, environmental, and biological sciences, among others. Since the nineties, a wealth of papers have provided a variety of methodologies related to the KF; for illustrative purposes, just a few representative contributions are cited below.

Mardia et al. (1998) considered a mixed approach between the KF algorithm and Kriging methodology (named as kriged Kalman Filter), in which the state equation incorporates different forms of temporal dynamics to model space-time interactions. Huang and Cressie (1996) and Wikle (2003) developed empirical Bayesian space-time KF models for the analysis of snow water equivalent and monthly precipitation. Xu and Wikle (2007) proposed a spatio-temporal dynamic model formulation with restricted parameter matrices based on prior scientific knowledge, and developed a general expectation maximization (GEM) algorithm to carry out the estimations. Stroud et al. (2010) applied a dynamic state-space model to a sequence of SeaWiFS satellite images on the Lake Michigan. In this study the authors implemented a comprehensive version of the KF, called Ensemble Kalman filter, which allows to deal with problems of nonlinearities and high dimensionality inherent in satellite images. To deal with forecasting on spatio-temporal processes, Zes 2014 used the state-space system and a time-varying parameter least squares autoregressive system, with their respective solving algorithms, the KF, and autoregressive Adaptive Least Squares (ALS). Bocquet et al. (2015) discussed the methods available for data assimilation in atmospheric models, including Ensemble Kalman filter. More recently, Ferreira et al. 2017 have proposed a state-space methodology to model spatio-temporal processes where the temporal dependency is captured by short or long memory models, such as autoregressive moving-average (ARMA(pq) henceforth) or autoregressive fractionally integrated moving-average (ARFIMA(pdq) henceforth) through the infinite moving-average representation MA(). Finally, Lagos-Álvarez et al. 2019 have used the univariate KF to make predictions at unobserved locations by using AR-type autoregressive temporal dependence models.

In the multivariate context, there has been a notable interest in incorporating multiple variables in spatio-temporal modeling of real phenomena that are encountered in many branches of science. An important tool to model these kind of data comes in terms of multivariate spatio-temporal covariances that may involve nonstationarity and interactions between different variables, locations, and times. In this context, Genton and Kleiber (2015) reviewed the construction of valid cross-covariance models, which include the linear model of coregionalization, convolution methods, the multivariate Matérn, the multivariate Wendland Daley et al. (2015) and spatio-temporal extensions. Looking for appropiate methods of estimation has been a major problem that has led to a long discussion by several geostatistical researchers, for example (Bevilacqua et al. 2015, 2016; Bourotte et al. 2016; Alegría et al. 2018) among others. These methodologies have as a main goal spatio-temporal interpolation and prediction at locations with unobserved data that allow for useful inventory and monitoring purposes. In this spirit, there are authors that have used multivariate models to analyze datasets with spatio-temporal dependencies. This is the case of Daniels et al. (2006) who proposed a class of conditionally specified models for the analysis of multivariate spatio-temporal processes, where the dependence structure across processes and over space and time is completely specified through a neighborhood structure. Zheng and Zhu (2008) studied the ensemble filter algorithm as an alternative to calculate the background error covariances on building a balanced error model for an intermediate coupled model for El Niño-Southern Oscillation (ENSO) predictions. Their approach to build such a model is proposed on the basis of the multivariate empirical orthogonal functions method. Padilla-Buriticá et al. (2011) considered multivariate autoregressive models for the estimation of the brain activity from electroencephalographic (EEG) time series. They used Kalman filtering to estimate the source dynamics between the EEG and the neural activity into the brain which can be computed using Maxwell equations. Bradley et al. (2015) introduced a fully Bayesian methodology with multivariate spatio-temporal dependencies to estimate quarterly measures of average monthly income over various geographies of the US. These authors computed the KF and Kalman smoothing equations within each MCMC iteraction. In the last decade, we have seen a significant increase in new proposals to deal with regression multivariate spatio-temporal processes. Examples include, but are not limited to, those designed for non-normal multivariate data analysis called multivariate covariance generalized linear models (MCGLMs) (Bonat and Jørgensen (2016)), functional data in the context of multivariate space-time models (Li et al. 2022), geographically weighted multivariate multiple regression (GWMMR) Chen et al. 2021, machine learning techniques applied to some multivariate analysis like Generative Adversarial Networks (GANs) of multiple agents’ spatiotemporal data Bao et al. 2022 and those concerned with Bayesian approaches that allow predicting multivariate space-time processes (e.g., Vicente et al. 2020; Zeng et al. 2021; Christiansen et al. 2022). It cannot be left unmentioned that these types of models have been very useful to describe the behavior of the COVID-19 outbreak, see Mattera (2022), Briz-Redón et al. (2022), Kianfar et al. (2022), Paul et al. (2021) among others. Little effort has been made to extend the KF to the multivariate spatio-temporal regression setting.

However, to the best of our knowledge, there are no studies that analyze the finite sample behavior of the KF estimator, for both short and long memory multivariate spatio-temporal processes. We note that the extension of the univariate KF (as proposed in Ferreira et al. (2017) and/or Lagos-Álvarez et al. (2019)) to the multivariate context could be considered sort of a simple case from a mathematical point of view. However, there are several reasons that motivate the extension to the multivariate approach. First, in terms of inference, parameter estimation through optimization of the truncated likelihood is not straightforward from the univariate case. This procedure involves the cross-correlation amongst the components of the processes forming the multivariate observations which comes in the temporal covariance. Additionally, it makes some further burden estimating the spatial correlation structure within the multivariate state-space representation. Finally, another aspect that in the univariate case is not direct, is the way the temporal cross-covariance in the multivariate VARFIMA (vector autoregressive fractionally integrated moving-average) models is added into the KF algorithm. This cross-covariance is well-defined under the condition that the coefficients of the MA() representation are absolutely summable.

In this paper, we indeed propose a simple and flexible modeling strategy for multivariate spatio-temporal models through the well-known Wold decomposition, which under the assumption of absolutely summable coefficients provides an explicit formula for the temporal structure of the covariance. This allows to cover the family of multivariate linear processes such as VARMA (vector autoregressive moving-average) and VARFIMA to represent the temporal dependence, and to characterize the spatial association through cross-covariance functions of Matérn type. We use the KF algorithm for estimation of spatio-temporal processes as in Ferreira et al. (2017) and find that only a few terms of the truncated representation of MA() expansions are enough to capture the multivariate spatio-temporal dependencies in the process. Furthermore, by using this truncated representation in state-space models, we achieve great savings in both memory and computation of the likelihood. Finally, taking advantage of the benefits of KF, we propose a methodology to predict the process at unobserved locations.

The plan of the paper is the following. Section 2 discusses a class of multivariate spatio-temporal process and their representation in MA() expansion. Section 3 presents the state-space models and the algorithm for estimating the parameters involved in the spatio-temporal dependence structure. In Sect. 4 a simulation study reveals the good performance of the estimation method with the KF technique. Section 5 applies our proposal to model the space-time variability of the average daily temperature and maximun daily solar radiation. The main conclusions are summarized in Sect. 6. A supplementary material is also considered.

A class of spatio-temporal processes

We consider L-variate spatio-temporal Gaussian processes,

Yt(s)=Yt(1)(s),,Yt(L)(s):sR2,tR,

where s is a spatial location and t is a temporal instant. Here, is the transpose operator and Yt()(s) denotes the -th component of the process. For a set of available temporal instants t{1,,T}, we adopt a modeling strategy based on the following expansion

Yt(s)=Mt(s)β+εt(s),withεt(s)=j=0Ψjηt-j(s). 1

This representation is parenthetical to the well-known MA() decomposition for the errors with Ψj a sequence of L×L matrices with absolutely summable components, i.e, j=0|ψ(j)|<, where ψ(j) denotes the row , column element of the moving-average parameter matrix Ψj associated with lag j. The main advantage of this representation is that it provides an explicit formula to obtain the temporal covariance matrix, which is responsible for the characterization of the temporal dependence, given by

Γ(u;Σ)=k=0ΨkΣΨk+u,u=0,±1,±2,, 2

where Σ is an L×L matrix whose (,)-entry is given by ρσσ, and ,{1,,L}. Here σ2>0 is the marginal variance of the process ηt()(s), and ρ is the colocated correlation coefficient between processes ηt()(s) and ηt()(s), for ,{1,,L}, with ρ=1. Additionally, the vector MA() representation used in this paper satisfies the absolute summability condition, which ensures that the vector process is ergodic for the mean and for the second moment (see Hamilton (2020), page 263 for more details). In (1), Mts=diag(Mt,1(s),,Mt,L(s)), where each Mt,(s) is a 1×p vector of non-stochastic regressors, for =1,,L. Thus, Mts is an L×=1Lp matrix, β=(β1,,βL) is a =1Lp×1 vector of parameters, with each β being a p×1 vector. Finally, ηt(s)=ηt(1)(s),,ηt(L)(s) is an L-variate spatio-temporal stationary Gaussian process with zero mean and covariance function given by

Covηt()(s),ηt()(s)=ΣC(s,s)if|t-t|=00if|t-t|0,

for all t,tR, s,sR2. Throughout, we assume that C(s,s)=C(s,s;ψ) is a univariate parametric correlation model, which depends on a vector of unknown parameters ψ. The spatio-temporal covariance structure of the process defined in (1) is

CovYt(s),Yt(s)=k,l=0ΨkCovηt-k(s),ηt-l(s)Ψl=C(s,s;ψ)k=0ΨkΣΨk+t-t=C(s,s;ψ)Γ(t,t;Σ). 3

We conclude that CovYt(s),Yt(s) is a separable spatio-temporal covariance function Gneiting 2002, where Γ(u;Σ) is an L×L matrix that regulates the temporal dependency of the process given in (2). An alternative way of writing the functional form of the space-time covariance matrix of this process is the following. Define the LN×1 vector Yt=Yt(s1),,Yt(sN) containing the L-dimensional data values at N spatial locations, at each time instant t, then the spatio-temporal covariance structure (3) is equivalent to

CY=CΓ=[C(si,sj;ψ)]i,j=1N[Γ(t,t;Σ)]t,t=1T, 4

where C and Γ are the spatial and temporal covariances, respectively, and is the Kronecker product. In this paper, the spatial covariance function C(si,sj;ψ) is specified by an admissible covariance. In particular, we consider the general class of Matérn covariance models Matérn (1986) given by

C(si,sj;ψ)=12ν-1Γ(ν)(si-sjα)νKν(si-sjα), 5

for i,j=1,,N, where α>0, ν0, σ2>0 and Kν is the modified Bessel function of the second kind of order ν. A popular special case of the Matérn family is the exponential model C(s,s;ψ)=exp(-s-sα) which is obtained when ν=1/2. The temporal covariance matrix Γ(u;Σ) is specified through multivariate ARFIMA linear processes Box et al. (2015).

We note that in general KF methods work under the assumption of separability of the covariance structures in space and time, as in a non-separable case the autoregressive component would be lost; in particular the ARFIMA structure would have no sense (we refer to contributions (Ferreira et al. 2017; Padilla et al. 2020), among other authors, to argue about the separability assumption). However, the separability in space and time does not mean independence among the processes that form the multivariate process observation. These processes are clearly dependent due mainly to the temporal correlation structure. We come back to this at the end of this section.

For a neater and self-contained exposition, the following examples discuss some stationary multivariate spatio-temporal ARFIMA processes with their structure of temporal covariance.

Example 1

An example of the regression model (1) is the L-variate ARMA process of orders p and q that satisfies the equation

Φ(B)εt(s)=Θ(B)ηt(s),t=1,,T,

where Φ(z)=IL-Φ1z--Φpzp and Θ(z)=IL+Θ1z++Θqzq are matrix-valued polynomials having full rank, Ir denotes the r×r identity matrix, and B denotes the backward shift operator. Each component of the matrices Φ(z) and Θ(z) is a polynomial with real coefficients and degree less than or equal to p and q, respectively. A multivariate process of this nature is commonly described as a VARMA process (the initial letter denoting “vector”). The MA() representation of the VARMA process is εt(s)=j=0Ψjηt-j(s) provided that detΦ(z)0 for all zC such that |z|1. The matrices Ψj are obtained recursively from the equations

Ψj=Θj+k=0ΦkΨj-k,j=0,1, 6

where Θ0=IL, Θj=0, for j>q, and Φj=0, for j>p. The temporal covariance matrix Γ(u;Σ) can be expressed as

Γ(u;Σ)=j=0ΨjΣΨj+u,

where the matrices Ψj are found from (6). A particular case of this model is the autoregressive vector of order one, VAR (1), defined by

εt(s)=Φεt-1(s)+ηt(s),withΨj=Φj,j=1,2,,

where Φ=ϕ11ϕ12ϕ21ϕ22 and Σ=σ12σ1σ2ρσ1σ2ρσ22.

Example 2

A more general class of stationary processes are the VARFIMA processes, given by

Φ(B)diag(d)εt(s)=Θ(B)ηt(s), 7

where

diag(d)=d1000d2000dL.

Here, =1-B, and 0<d<1/2, for all =1,,L. Following Tsay (2010), the model defined in (7) can be represented as

diag(d)Φ(B)εt(s)=Θ(B)ηt(s).

These representations are identical if either Φ(B) is diagonal or the values of the differencing parameters remain the same across =1,,L. In the univariate case (L=1) the distinction between both models is irrelevant. The simplest case is the VARFIMA model with p=q=0, where we have the dynamic εt(s)=diag(-d)ηt(s), or equivalently, the -th component can be written as

εt()(s)=j=0ψj,ηt-j()(s),=1,,L, 8

where ψj,=Γ(j+d)Γ(j+1)Γ(d), and Γ(·) is the gamma function. The (,)-th element of the temporal covariance function for model (8) is

Γ(u;Σ)=j=0ψj,Σψj+u,=ΣΓ(1-d-d)Γ(d)Γ(1-d)Γ(u+d)Γ(u+1-d),

for every ,=1,,L. Other variants of the VARFIMA model can be found in Tsay (2010). For computational efficiency, it is well-known in the literature that Γ(u;Σ) can be calculated as follows

j=0ψj,Σψj+u,=ΣΓ(1-d-d)Γ(1-d)Γ(1-d)0kuk-1+dk-d.

In relation to the separability condition, some comments are in order. Although separable covariance functions can not take into account the interaction between the spatial and temporal components, such an interaction can be injected through the space-time trend Mt(s)β, see Genton (2007). On the other hand, separable structures allow for considerable computational gains as, for a given set of space-time locations and observations, the space-time covariance matrix factors into the product of a purely spatial with a purely temporal covariance matrix. Another advantage of the separable space-time covariance structure is the construction of a valid (positive definite) parametric model for the space-time correlation structure, which is not straightforward if we consider nonseparable space-time covariance models. For details, the reader is referred to Porcu et al. (2019) with the references therein.

State-space representation

In this section we discuss the state-space (SS) representation of the model defined in Eq. (1) with a space-time covariance structure given by (3) or (4). Once the SS system has been established, inferences are made about the state equation using all the information available until time t through the KF algorithm. Then if Yt(s) is an observation available at an instant of time t for some location s, the SS system for spatio-temporal processes is defined as follows

Yt(s)=Gt(s)Mt(s)Xt(s)βt+Wt(s),Xt+1(s)βt+1=Ft(s)00IpXt(s)βt+H0Vt(s), 9

where Gt(s) is the observation operator vector, Mt(s) is a vector of exogenous or predetermined variables, [Xt(s)βt] is a state vector, and Wt(s) is an observation noise with variance R. In addition, Ft(·) is a state transition operator, H is a linear operator, Vt(s) is spatially colored, temporally white and Gaussian with mean zero and a common covariance matrix Q, and Vt(s) and Wt(s) are uncorrelated, that is, EWt(s)Vt(s)=0, for all s and t. The process (1) can be represented by an SS system as above by generalizing the infinite-dimensional equations given by Ferreira et al. (2017) to the spatio-temporal multivariate case. This can be achieved by assignation of the (j+1)-th component of the state vector as the lag in j steps of the L-dimensional Gaussian process ηt(s), i.e. Xt(j+1)(s)=ηt-j(s), for j=0,1,. In this case, the process specified by (1) can be represented by the following infinite-dimensional state-space system

graphic file with name 477_2022_2266_Figa_HTML.gif

whereInline graphicI=diag{1,1,}, and Wt=0. The computational burdens for estimating the VARFIMA model are well-known in terms of dimensionality and span of data series (Tsay 2010; Morana 2007). We thus truncate the expansion in (1) after some positive integer m, so that

Yt(s)=Mt(s)β+ε~t(s), 10

where ε~t(s)=j=0mΨjηt-j(s). Thus, the SS representation of the model (10) is considered, with observation and state equations given by

graphic file with name 477_2022_2266_Equ11_HTML.gif 11

where Inline graphic

Let us denote by Mp×q the space of p×q matrices with real elements. Thus, we have that GML×L(m+1), Xt(s)ML(m+1)×1, FML(m+1)×L(m+1) and Vt+1(s)ML(m+1)×1. A study about the truncation level m will be presented later through a similarity study for spatio-temporal data following a truncated MA representation as in (10).

Derivation of the KF algortihm

The KF is a powerful tool to make inferences about the state vector which allows to calculate the conditional mean and covariance matrix of the state vector Xt(s),β(s). For simplicity, we assume that the trend function in (10) is identically equal to zero. Thus, (11) becomes the usual state equation. The KF recursion equations are well-known, but we present them here to introduce notation and for completeness. First, define the NL(m+1)-dimensional vector Xt=Xt(s1),,Xt(sN) as an unobservable spatio-temporal state process. In addition, we define the best linear unbiased predictor (BLUP) of the unobserved state Xt(s) and its error variance-covariance matrix as follows

X^t(s)=EXt(s)|Y1,,Yt-1,Ωt(s,s)=CovXt(s)-X^t(s),Xt(s)-X^t(s).

The Kalman recursive equations are defined as follows for the initial conditions X^1(s)=Eη1-j(s)=0, for j=0,1,,m, and

Ω1(s,s)=EX1(s)X1(s)=C(s,s;ψ)Γ(0;Σ)0L×L0L×LC(s,s;ψ)Γ(0;Σ)=I(m+1)C(s,s;ψ)Γ(0;Σ)(L×L).

The KF allows to estimate the state vector Xt+1(s) and its prediction variance based on the information available at time t. These estimators are given by

X^t+1(s)=FX^t(s)+Θt(s)Δt-1(Yt-Y^t),Ωt+1(s,s)=FΩt(s,s)F+Qtη(s,s)-Θt(s)Δt-1Θt(s), 12

where

Δt=VarYt-Y^t|Yt-1=VarG(Xt-X^t)|Yt-1=GΩt(s1,s1)GGΩt(s1,sN)GGΩt(s1,sN)GGΩt(sN,sN)G,Θt(s)=Cov(Xt+1(s),Yt-Y^t)=Cov(Xt+1(s),Yt(s1)-Y^t(s1))Cov(Xt+1(s),Yt(sN)-Y^t(sN))=FΩt(s,s1)GFΩt(s,sN)G,Y^t=E(Yt|Y1,,Yt-1)=GXt(s1)GXt(sN),Qtη(s,s)=CovVt+1(s),Vt+1(s)=T(m+1)C(s,s;ψ)Γ(0;Σ),

where T(m+1) has dimension (m+1)×(m+1) and is given by T(m+1)=100000000. Let θ be the vector that contains all the parameters involved in the model, then the log-likelihood function (omitting a constant) can be obtained from (12),

L(θ)=-12t=1Tlog|Δt(θ)|+ϵt(θ)Δt(θ)-1ϵt(θ),

where ϵt(θ)=Yt-Y^t is the innovation vector and Δt(θ) is the innovation covariance matrix at time t. Hence the exact maximum likelihood estimate (MLE) provided by the Kalman equations (12) is given by θ^=argmaxθL(θ). Note that the Kalman Eq. (12) can be applied directly to the general state-space representation (9) or to the truncated representation (11), yielding in the latter case an approximated MLE.

It is worth noting that for time series models with a long memory structure, Chan and Palma (1998) proved that the exact likelihood function can be computed recursively in a finite number of steps. On the other hand, an approximation to the likelihood function based on the truncated SS equation is also possible obtaining a considerable reduction in the number of iterations of the algorithm. Indeed, the order of computation of the algorithm is m2×T instead of T3. In the context of multivariate spatio-temporal processes the number of iterations increases considerably, for example, if N is the number of spatial locations, we also need to solve matrices of order LN×LN for each iteration, i.e. the algorithm has an order of computation of (LN)3×m2×T. Here, and in order to reduce computational time, we have implemented the KF algorithm in C source code connected to R free software R Core Team (2015) through the interface called .CChaudhary (2007); in particular, the QR decomposition method for the inverse of matrices was implemented into pure C. On the other hand, numerical optimization of the Gaussian log-likelihood function to obtain the QML estimates was carried out using the optim command of R. This method makes use of the subroutine “L-BFGS-B” corresponding to a quasi-Newton method (Zhu et al. 1997; Dai 2002). A numerical analysis about the computational time to estimates a VAR(1) model is available in the supplementary material, Section A.1 (online).

Spatial interpolation through the KF

Following the proposal of Lagos-Álvarez et al. (2019) for the univariate case, we propose spatial interpolation in a multivariate context by means of the KF algorithm, at locations where we do not have information. Denote by s0 a location where there is no record of Yt(·). The equations that allow us to achieve predictions for Yt(s0) given the information Yt, for t=1,,T, are given by

X^t+1(s0)=FX^t(s0)+Θt(s0)Δt-1(Yt-Y^t),Ωt+1(s0)=FΩt(s0)F+Qtη(s0)-Θt(s0)Δt-1Θt(s),

where

Θt(s0)=CovXt+1(s0),Yt|Yt-1=Cov(Xt+1(s0),Yt(s1))Cov(Xt+1(s0),Yt(sN))=FΩt(s0,s1)GFΩt(s0,sN)G,Ωt(s0)=CovXt(s0),Xt|Xt-1,Qtη(s0)=CovVt+1(s0),Vt+1,andΘt(s)=CovXt+1(s),Yt|Yt-1.

Then, the updated equation through the KF leads

Ωt+1(s0,s0)=FΩt(s0,s0)F+Qtη(s0,s0)-Θt(s0)Δt-1Θt(s0),

and thus the prediction variance at location s0 is given by

Δt(s0)=Var(Yt(s0)-Y^t(s0))=GΩt(s0,s0)G.

Note that according to the state vector update in (12), it requires that all observations in time are available. Then, if the data have missing values, they must be previously imputed, a methodology in this line is proposed by Padilla et al. (2020) for the space-time autoregressive univariate model.

Simulation studies

In this section, several Monte Carlo experiments are carried out to analyze the finite sample behavior of the KF estimator for short and long memory spatio-temporal processes. We work under the bivariate (L=2) and trivariate (L=3) cases. We consider spatial sites being uniformly distributed on the square [0,1]2, with N×T=25×200 or N×T=100×300. Thus, when L=2, we have 10×103 and 60×103 observations, respectively. We consider a correlation structure C(s,s;α) of Matérn type given in (5), which is the most popular correlation structure used throughout the spatial statistics literature. In particular, we use two special cases of the Matérn model:

  • Model 1. Matérn with smoothness parameter ν=12, corresponding to the exponential model
    C(h;α)=exp-hαwithh=s-s. 13
  • Model 2. Matérn with smoothness parameter ν=32, which leads to
    C(h;α)=1+hαexp-hα.

Here, α>0 is a scale parameter that controls the rate of the spatial dependence decay; a large value of α corresponds to a high spatial correlation. For the temporal dependence, we consider two models, a VAR(1) in a bivariate short memory case, and a VARFIMA(0, d, 1) in a trivariate long memory case. The source codes used in the paper in R and C are available upon request to the corresponding author.

Short memory case

Consider the VAR(1) model with the errors defined by (1), where

εt(1)(s)εt(2)(s)=ϕ11ϕ12ϕ21ϕ22εt-1(1)(s)εt-1(2)(s)+ηt(1)(s)ηt(2)(s). 14

The sequence of matrices in (1) satisfies Ψj=Φj=ϕ11ϕ12ϕ21ϕ22j. The unknown parameters are Φ and ϑ, where ϑ=(σ12,σ22,ρ12,α). We study the following scenarios:

  • Case 1. θ=(Φ,ϑ)=(0.45,0.2,0.15,0.65,1,1,0.5,0.5)

  • Case 2. θ=(Φ,ϑ)=(0.15,0,0,0.35,1,0.5,-0.3,1.5).

We assume that we observe a spatio-temporal process εt(s) at t=1,,T and s=(s1,,sN), on a regular rectangular grid of N spatial locations in [0,1]2 and at equidistant time points. For the data generation scheme, the process is generated recursively from (14) with initial values ε1()(s)N(0,σ2) for =1,2. In order to give a neater picture about the finite sample performance of the KF algorithm, Monte Carlo evaluations of 100 independent realizations are performed and the results from these 100 independent realizations are hence represented by boxplots of the absolute error and displayed in Fig. 1 Panel (a). For the case of Model 2 and parameter vector as in Case 2 schemes, we also report boxplots of the absolute error, see Fig. 1 Panel (b). Inspecting both figures, we conclude that the use of the KF estimator for multivariate short memory spatio-temporal processes is sufficiently accurate. Moreover, it is also shown that for all the schemes, the Inter-quartile range (IQR) of the estimated parameters is in most cases smaller than 0.05, i.e., two decimal digits of precision. This demonstrates that the error of the KF estimator methods is really small, thus the method is robust and works extremely well for both sample sizes.

Fig. 1.

Fig. 1

Boxplots for 100 realizations of the absolute error in terms of the estimations from a 2-dimensional VAR(1) model: Panel a: the case N=25. Panel b: the case N=100. Panel c: 3-dimensional VARFIMA(0, d, 1) model with N=25

Long memory case

This subsection considers the impact of VARFIMA(0, d, 1) parameters on the performance of the KF algorithm. We focus on the following 3-dimensional model

d1000d2000d3εt(1)(s)εt(2)(s)εt(3)(s)=1+θ11B0001+θ22B0001+θ33Bηt(1)(s)ηt(2)(s)ηt(3)(s),orεt()(s)=k=0ψk,ηt-k()(s),=1,2,3, 15

with ψk,=Γ(k+d)Γ(k+1)Γ(d)+θΓ(k+d-1)Γ(k)Γ(d), where d is the long memory coefficient such that 0<d<1/2 and θ is the moving-average coefficient satisfying |θ|<1. Concerning the innovations {ηt()(s)} in MA() representation (15), these were generated by using the innovation algorithm (see Hamilton (2020)). In this implementation, the temporal covariance of the process {εt()(s)} is given by Γ(u;Σ)=[ΣΓ(u)],=13, where

Γ(u)=Γ(1-d-d)Γ(u+d)Γ(1-d)Γ(d)Γ(u+1-d)×1+θθ+θu-du-1+d+θu+du+1-d,

and Σ=[ρσσ],=13. The parameters considered are the following: (d1,d2,d3)=(0.4,0.3,0.2), and (θ11,θ22,θ33)=(-0.7,-0.5,-0.3). To capture the spatial dependence, we consider a trivariate covariance exponential model, defined in Eq. (13). The parameters of Model 1 together with the variance and correlation parameters are given by ϑ=(1,1,1,0.5,0.5). Note that we have assumed ρ=0.5 for ,{1,2,3} in this simulation scenario. From Fig. 1 Panel (c) we see that the IQR of parameter vector θ is much broader compared to the rest of the parameters. The rest of the parameters have relatively low IQR, in all the cases below 0.05. This confirms the earlier result about the the robustness of the KF method for parameter estimation in multivariate long-memory processes. For a detailed formulation of both studies, we refer the readers to supplementary material in Section A.2.

Truncation level m

An interesting question that arises from our proposal has to do with the optimal truncation level to obtain estimates close to the true value of the parameter. In this section, a study is carried out consisting of data coming from Model 1 for both a VAR(1) and a VARFIMA trivariate model defined in (14) and (15), respectively, and where we consider different values of the truncation level m. In each case, we have used N=25 and 250 observations over time. Figure 2 shows the absolute error in terms of the truncation level for the estimates of VAR(1) and a VARFIMA trivariate model. From both Figures we underline that for m=5 the estimates are close to the horizontal line, and this is improved as this value increases. For m=30,40 the estimates stabilize. This suggests that the estimates obtained by the approximated MLE require a small number of m.

Fig. 2.

Fig. 2

Absolute errors of the parameter estimates as a function of m for the MA approximation. Panels a, b: Estimation of a VAR(1) model defined in (14) with Model 1 and Case 1. Panels c, d: Estimation of a VARFIMA trivariate model defined in (15) with covariance function following an exponential model

Real data illustration

This section analyzes part of the integrated Agromet network which contains more than 100 meteorological stations throughout Chile, including daily temperature data, soil temperature, rainfall, humidity, solar radiation, wind speed and direction, among others. This dataset is reported and updated by the Institute of Agricultural Research (INIA) and can be available from the website http://www.agromet.cl.

We focus our interest in three regions, namely Maule, Biobío and Araucanía which represent a portion of south-central Chile. This area is surrounded by mountains (mountain of the coast to the east and mountain of the Andes to the west). According to the 2007 agriculture national census, this area has a surface of 99.206 km2. The Maule Region concentrates 17.2% of the crop national area, its main use corresponds to forest plantations, followed by cereals, fruit trees, forage plants and vineyards and parronals, groups that together respond to 94% of the area of crops in the region. The BíoBío region accounts for 28.1% of the crop national area. The main use, with 79.0% of the total, is for forest plantations, with cereals and forage plants, but with a smaller participation. Finally, Araucanía region covers 20.6% of the crop national area, its main use corresponding to forest plantations with 64.3% of that total, followed by grains with 18.5% and forage plants with 9.8%. These three uses account for 92.6% of the region soil. As can be noted, the study area is mostly devoted to agriculture, and therefore both air temperature and solar radiation are predominant factors in crop growth; note that extreme values of temperature and radiation affect the production and quality of agricultural and fruit products. The frost causes a deterioration of production, reducing the activity of the agricultural industry in the south-central zones of our country, generating losses of thousands of dollars, besides to the paralyzation and the low activity in the exports to the external market. In this way, it is of great interest to study the spatio-temporal variability of these meteorological processes, and therefore to generate proposals that help to explain such variability in a coherent and appropriate way.

In particular, we study the behavior of the space-time variability of the average daily temperature and maximum daily solar radiation, information obtained from the 21 meteorological stations located between Maule, Biobío and Araucanía Regions. Figure 3 displays the spatial distribution of these meteorological stations.

Fig. 3.

Fig. 3

Locations of the selected 21 meteorological stations in the Maule, Biobío and Araucanía regions, Chile

In section A.3 of the supplementary material we can find some exploratory analysis. In particular, Panel (a) of Fig. A3.4 (online) shows a plot of the average daily temperature time series observed during the year 2016, and panel (b) displays the maximum daily solar radiation time series. Also, Table A3.1 (online) summarizes some descriptive statistics of the average daily temperature Yt(1) and the maximum daily radiation Yt(2) for some selected meteorological stations.

The model

We consider the following bivariate spatio-temporal model for the Agromet data

Yt(1)(s)Yt(2)(s)=Mt,100Mt,2β1β2+εt(1)(s)εt(2)(s). 16

The time series shown in Fig. A2.2 (online) seems to present a seasonal component. In particular, a joint analysis of the periodogram of the detrended data and the autocorrelation function (ACF) revealed the presence of plausible seasonal frequencies at ω1=2π366.25 and ω2=2π183.25. Thus, the following non-stochastic regressors are considered

Mt,1=Mt,2=β0+j=12βjsin(tωj)+αjcos(tωj)+β3h(s)+β4lat,

where h(s) is the elevation and the covariate lat stands for the latitude; note that both affect the temperature and radiation. In order to obtain information on the temporal correlation structure of the data, we study the residuals eT=Yt(s)-Mt(s)β^ for t=1,,T. The marginal sample ACF and cross-ACF of the residuals are shown in Fig. A3.5 of supplementary material. Both plots suggest that the time series exhibit short-range dependence. In particular, the disturbances {εt(2)(s)} in the linear regression model given in (16) have a VAR(1) structure, i.e.,

εt(1)(s)εt(2)(s)=ϕ11ϕ12ϕ21ϕ22εt-1(1)(s)εt-1(2)(s)+ηt(1)(s)ηt(2)(s),

where {ηt(·)} are independent over time and follow a stationary Gaussian spatial process with mean zero and spatial covariance structure defined in (13) with =2. Table 1 reports the parameter estimates using the KF with truncation level m=5. We can observe that the estimated autoregressive coefficients are high, ϕ11=0.5845 and ϕ22=0.5033, which suggest that the temperature and solar radiation data are highly correlated to the temperature of the previous day. On the other hand, the temporal dependence of Yt(1) on Yt(2) is strong with ϕ12=0.2959, the relationship between Yt(2) and Yt(1) is weak with ϕ21=0.0053. Additionally, the estimated range parameter is high, α=1.4378, in relation to the spatial sampling scheme, with a maximum distance between the stations of h=381.8 kms. This informs that the spatial process has a high spatial continuity. Finally, the correlation coefficient between Yt(1)(si) and Yt(1)(sj) for i,j=1,,21 is positive with value ρ=0.1011.

Table 1.

Parameter estimates for average daily temperature and daily solar radiation data

Mt(s) β0 Parameter β4
β1 α1 β2 α2 β3
35.9932 2.3869 −0.0372 −0.0034 0.1793 0.2545 −0.0085
εt(s) ϕ11 ϕ12 ϕ21 ϕ22 σ12 σ22 ρ α
0.5845 0.2959 0.0053 0.5033 1.0681 0.9994 0.1011 1.4378

Prediction

The KF algorithm has proven to be a powerful tool for interpolation, see Lagos-Álvarez et al. (2019). Taking advantage of the benefits of this algorithm, we use the procedure shown in Sect. 3.2 to predict the average temperature and radiation variables at unobserved locations for the three regions under study, that is, Maule, Biobío and Araucanía regions. For this purpose we considered a grid of 5955 nodes within the study area. In each of the nodes, we made predictions for the two variables, therefore we interpolated 11910 records. Additionally, the time domain considered was from January 01 to December 31, 2016, that is 365 days, with a total of 4347150 observations. Due to space constraints, only a subset of the results are presented, in particular we report the predictions for the fortnights of each month. Figure 4 depicts the predictions every fifteen days for all the months of the year. From Panel (a), and omitting the interpolations associated with the Andes mountains, a fairly marked general seasonality can be observed, lower temperatures are associated with the winter months, while the higher ones to the summer season. It should be noted that in June and July, the coast presents lowest temperature values than in the intermediate depression. This is more evident on the border of the Biobio and Araucanía regions, in particular the Cañete area (ID 10), area where the Nahuelbuta mountain range predominates. Regarding the prediction of radiation (Fig. 4(b)), we can see that the intermediate zone has higher radiation in the summer season, compared to the winter season. Once again, the area of Cañete to the south along the coast has lower levels of radiation in all seasons of the year. Additionally, Fig. 5 shows prediction variance maps. In relation to the prediction variance, we note that in the observation locations a smaller prediction variance is reflected.

Fig. 4.

Fig. 4

Prediction of space-time environmental data of the fortnights of each month of the year 2016: Panel a: Temperature space-time predictions. Panel b: Radiation space-time predictions

Fig. 5.

Fig. 5

Prediction variance of space-time environmental data of the fortnights of each month of the year 2016: Panel a: Temperature prediction variance. Panel b: Radiation prediction variance

Performance of the prediction procedure

The aim of this section is to compare the prediction performance through a cross-validation study to see the performance of the model represented by Eq. (16) (denoted here by Model 1) with models that have the same temporal dynamics and that have used the KF as a prediction tool. This will allow us to justify the use of our method compared to other standard KF implementations. For this, two additional models will be defined, which are described below:

  • Model 2: A VAR(1) model as that in (16) is considered. However, the setting of both the SS representation and the KF differs from the proposal of this paper, that is, here we will not use the truncated MA representation as in (10). Here we use the classical development of the KF (see Hamilton (2020)) with Gt(s)=I2, Xt(s)=Yt(s), Ft(s)=Φ, and Vt(s)=[ηt(1)(s),ηt(2)(s)] in (9).

  • Model 3: A more general version of model (16) is considered by using location-dependent parameters that describe spatial non-stationarity in the VAR(1) model. In this case, the sequence of matrices in (10) satisfies Ψj(s)=Φj(s)=ϕ11(s)ϕ12(s)ϕ21(s)ϕ22(s)j, with s=(s1,,s21) and j=1,,m.

The estimates of Φ(s) for Model 3 are carried out using non-parametric techniques based on cubic splines. In Fig. 6 the continuous line represents the estimates (through cubic splines) of the spatial movement of the parameters of the VAR(1) model. The dots represent the heuristic approach (i.e., for each location, the parameters determining a stationary VAR(1) process are estimated), and the horizontal dashed line indicates the estimates of a stationary VAR(1) model with KF.

Fig. 6.

Fig. 6

Agromet data: Top-Left, estimates of ϕ11(s). Top-Right, estimates of ϕ12(s). Bottom-Left, estimates of ϕ21(s). Bottom-Right, estimates of ϕ22(s). In all panels the continuous line represents the curves estimated through cubic splines, the dots represent the heuristic approach, and the horizontal dashed line indicates the estimates of stationary VAR(1) model with KF

To compare the prediction performance of Models 1-3, a cross-validation method was proposed. Towards this end, we have removed the data {Yt(si):t=1,,366} for each i=1,,21, and predicted Yt(si) from the remaining data. The predicted value of Yt(si) is denoted by Yt(j)(si), where j=1,2,3 represents Models 1-3, respectively. Finally, the criterion used to quantify the predictive power was the mean squared error of prediction (MSEP) given by

MSEP(Yt(·)(si))=1366t=1366[Yt(si)-Yt(·)(si)]2,

and the gain, to quantify how better Model 1 is, is given by

Gainj=(MSEP(Yt(j)(si))MSEP(Yt(1)(si))-1)×100%forj=2,3.

Table 2 shows the gain results for the MSEP through a cross-validation method using the KF algorithm for Models 1–3. The gain of Model 1 with respect to Model 2 is similar in both variables, on average the profit is 0.5 and 0.3% respectively. This small gain is expected since they are similar performances in relation to the stationary temporal dynamics of the VAR(1) model. While the gain of Model 1 with respect to Model 3 is significant only in variable Yt(2)(si), with an average gain of 3.6%. However, the first variable shows a slight gain in favor of Model 3. From this information we can say that our proposal provides a potentially powerful prediction tool compared to Models 2-3.

Table 2.

Gain obtained through a cross-validation method using models 1–3

Station ID Gain2% Gain3%
Yt(1)(si) Yt(2)(si) Yt(1)(si) Yt(2)(si)
Coronel del Maule 1 0.2149 −0.5302 −0.7299 1.9439
Los despachos 2 1.5979 −0.7680 −0.8654 2.8899
Chanco 3 0.5868 0.4406 −0.1610 3.4481
Santa Sofía 4 −0.9589 1.8506 −0.6698 2.1915
San Clemente 5 0.8052 0.7750 −0.1333 0.8885
Coronel 6 1.0312 7.5340 −0.2641 4.5875
Chiguayante 7 1.6519 2.0511 −0.4154 4.9166
Human 8 1.9147 2.6118 −0.3967 3.6875
Cañete 9 −1.0350 −3.6344 −0.1074 3.3202
Nueva Aldea 10 0.4047 −0.0339 −0.3177 1.4633
Ninhue 11 2.7436 −2.3497 −0.5329 1.4357
Navidad 12 −0.7507 1.1819 −0.5580 3.6217
Punta Parra 13 −1.4067 0.8627 −0.2837 5.6725
Sta Rosa 14 1.5834 1.4270 −0.6897 3.5880
Dominguez 15 0.2926 −1.8261 −0.1475 6.4187
C. Llollinco 16 1.7383 −2.2445 −0.0523 4.2171
Cuarta Faja 17 1.6171 −4.5170 −0.0487 2.8786
Quiripio 18 −1.5453 1.0951 −0.1922 8.7234
San Luis 19 0.4883 0.4539 −0.1199 2.1234
Tranapuente 20 1.1039 1.3299 −0.3594 5.4774
Sta. Adela 21 −1.8847 0.2858 −0.2652 3.1700
Average 0.4854 0.3360 −0.3481 3.6506

Conclusions

We have presented a state-space methodology to model multivariate spatio-temporal processes. In particular, we have proposed to model the temporal dependence structure through the infinite moving-average representation MA(). In this context, we have incorporated the ARFIMA models to quantify the temporal correlation and valid Matérn cross-covariance models to characterize the spatial correlation in the spatio-temporal processes. In terms of the estimation procedure, we have proposed an approximation to the likelihood functions via truncation which provides an efficient means to calculate the MLE. Simulation studies evidenced that the proposed approach can be extremely efficient for small truncation levels. Furthermore, this approach allows to overcome the computational burdens while reducing substantially the size of the required memory whenever we deal with large spatio-temporal datasets.

In general, we can mention that one of the major advantages of using multivariate KF, is the use of a single model that explains all the phenomena in question, a remarkable quality in practice, aiming at explaining the phenomenon simply using most of the available information. In contrast to the more classical geostatistical techniques that only incorporate spatial information, where we have to propose a different model associated with a given instant of time.

Our methods can be easily extended to the estimation of multivariate space-time covariance models when considering fully symmetric covariance models. For estimation of asymmetric covariance models (Stein 2005), asymmetry in time should be taken into account in the state-space system following the lines of Sect. 3. Finally, note that a typical problem, shared in a number of practical applications, is that many multivariate spatio-temporal datasets are affected by missing data. The SS methodology allows to directly tackle this issue only in the univariate time series case. Padilla et al. (2020) developed space- time estimation and prediction methods in presence of missing data by using an EM algorithm. Thus, that is a promising topic for future research, which will allow to face missing values in multivariate spatio-temporal data. Finally, we could not find any R packages for fitting joint ARFIMA o ARMA space-time models in the multivariate case by using KF algoritm. We plan to develop a full multivariate KF package that takes full advantage of the results obtained in this study. At the same time, it would be extremely interesting to write an additional work with a targeted comparison between the existing methods. We take it as a challenge for further research.

Supplementary Information

Below is the link to the electronic supplementary material.

Acknowledgements

Guillermo Ferreira would like to express his thanks for the support from ENLACE 2018.014.028-1, established by the Universidad de Concepción and Center for the Discovery of Structures in Complex Data (MiDas). Jorge Mateu has been partially funded by Grant No. MTM2016-78917-R from the Ministerio de Economía y Competitividad and by Grant No. UJI-B2018-04 from University Jaume I. Finally, the authors would like to thank Dr. Alfredo Alegría for their advice and comments.

Declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Footnotes

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  1. Alegría A, Porcu E, Furrer R. Asymmetric matrix-valued covariances for multivariate random fields on spheres. J Stat Comput Simulat. 2018;88(10):1850–1862. doi: 10.1080/00949655.2017.1406488. [DOI] [Google Scholar]
  2. Bao P, Chen Z, Wang J, Dai D. Multiple agents’ spatiotemporal data generation based on recurrent regression dual discriminator gan. Neurocomputing. 2022;468:370–383. doi: 10.1016/j.neucom.2021.10.048. [DOI] [Google Scholar]
  3. Bevilacqua M, Hering AS, Porcu E. On the flexibility of multivariate covariance models: comment on the paper by genton and kleiber. Stat Sci. 2015;30(2):167–169. doi: 10.1214/15-STS516. [DOI] [Google Scholar]
  4. Bevilacqua M, Alegria A, Velandia D, Porcu E. Composite likelihood inference for multivariate gaussian random fields. J Agricult, Biol and Environ Stat. 2016;21(3):448–469. doi: 10.1007/s13253-016-0256-3. [DOI] [Google Scholar]
  5. Bocquet M, Elbern H, Eskes H, Hirtl M, Zabkar R, Carmichael G, Flemming J, Inness A, Pagowski M, Pérez Camaño J, et al. Data assimilation in atmospheric chemistry models: current status and future prospects for coupled chemistry meteorology models. Atmosph Chem Phys. 2015;15(10):5325–5358. doi: 10.5194/acp-15-5325-2015. [DOI] [Google Scholar]
  6. Bonat WH, Jørgensen B. Multivariate covariance generalized linear models. J Royal Stat Soci: Series C (Appl Stat) 2016;65(5):649–675. [Google Scholar]
  7. Bourotte M, Allard D, Porcu E. A flexible class of non-separable cross-covariance functions for multivariate space-time data. Spat Stat. 2016;18:125–146. doi: 10.1016/j.spasta.2016.02.004. [DOI] [Google Scholar]
  8. Box GE, Jenkins GM, Reinsel GC, Ljung GM. Time series analysis: forecasting and control. New Jersey: Wiley; 2015. [Google Scholar]
  9. Bradley JR, Holan SH, Wikle CK. Multivariate spatio-temporal models for high-dimensional areal data with application to longitudinal employer-household dynamics. Ann Appl Stat. 2015;9(4):1761–1791. doi: 10.1214/15-AOAS862. [DOI] [Google Scholar]
  10. Briz-Redón Á, Iftimi A, Correcher JF, De Andrés J, Lozano M, Romero-García C. A comparison of multiple neighborhood matrix specifications for spatio-temporal model fitting: a case study on covid-19 data. Stochas Environ Res Risk Assess. 2022;36(1):271–282. doi: 10.1007/s00477-021-02077-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Chan NH, Palma W. State space modeling of long-memory processes. Ann Stat. 1998;26(2):719–740. doi: 10.1214/aos/1028144856. [DOI] [Google Scholar]
  12. Cheng C, Sa-Ngasoongsong A, Beyca O, Le T, Yang H, Kong Z, Bukkapatnam ST. Time series forecasting for nonlinear and non-stationary processes: a review and comparative study. lie Trans. 2015;47(10):1053–1071. [Google Scholar]
  13. Christiansen R, Baumann M, Kuemmerle T, Mahecha MD, Peters J. Toward causal inference for spatio-temporal data: conflict and forest loss in colombia. J Am Stat Associat. 2022;117(538):591–601. doi: 10.1080/01621459.2021.2013241. [DOI] [Google Scholar]
  14. Dai Y-H. Convergence properties of the BFGS algoritm. SIAM J Optimiz. 2002;13(3):693–701. doi: 10.1137/S1052623401383455. [DOI] [Google Scholar]
  15. Daley DJ, Porcu E, Bevilacqua M. Classes of compactly supported covariance functions for multivariate random fields. Stoch Environ Resear Risk Assess. 2015;29(4):1249–1263. doi: 10.1007/s00477-014-0996-y. [DOI] [Google Scholar]
  16. Daniels MJ, Zhou Z, Zou H. Conditionally specified space-time models for multivariate processes. J Computat Graph Stat. 2006;15(1):157–177. doi: 10.1198/106186006X100434. [DOI] [Google Scholar]
  17. Ferreira G, Rodríguez A, Lagos B. Kalman filter estimation for a regression model with locally stationary errors. Comput Stat & Data Anal. 2013;62:52–69. doi: 10.1016/j.csda.2013.01.005. [DOI] [Google Scholar]
  18. Ferreira G, Mateu J, Porcu E. Spatio-temporal analysis with short- and long-memory dependence: a state-space approach. TEST. 2017;27(1):221–245. doi: 10.1007/s11749-017-0541-7. [DOI] [Google Scholar]
  19. Genton MG. Separable approximations of space-time covariance matrices. Environmet: The Off J Int Environ Soci. 2007;18(7):681–695. doi: 10.1002/env.854. [DOI] [Google Scholar]
  20. Genton MG, Kleiber W. Cross-covariance functions for multivariate geostatistics. Stat Sci. 2015;30(2):147–163. doi: 10.1214/14-STS487. [DOI] [Google Scholar]
  21. Gneiting T. Nonseparable, stationary covariance functions for space-time data. J Am Stat Associat. 2002;97(458):590–600. doi: 10.1198/016214502760047113. [DOI] [Google Scholar]
  22. Grassi S, de Magistris PS. When long memory meets the kalman filter: a comparative study. Computat Stat & Data Anal. 2014;76:301–319. doi: 10.1016/j.csda.2012.10.018. [DOI] [Google Scholar]
  23. Grewal MS. Kalman filtering. Berlin: Springer; 2011. [Google Scholar]
  24. Grewal MS, Weill LR, Andrews AP. Global positioning system, inertial navigation and integration. New Jersey: Wiley; 2001. [Google Scholar]
  25. Hamilton JD. Time series analysis. New Jersey: Princeton University Press; 2020. [Google Scholar]
  26. Huang H-C, Cressie N. Spatio-temporal prediction of snow water equivalent using the kalman filter. Computat Stat & Data Anal. 1996;22(2):159–175. doi: 10.1016/0167-9473(95)00047-X. [DOI] [Google Scholar]
  27. Kalman RE. A new approach to linear filtering and prediction problems. J Basic Eng. 1960;82(1):35–45. doi: 10.1115/1.3662552. [DOI] [Google Scholar]
  28. Kianfar N, Mesgari MS, Mollalo A, Kaveh M. Spatio-temporal modeling of covid-19 prevalence and mortality using artificial neural network algorithms. Spat Spatio-temp Epidemiol. 2022;40:100471. doi: 10.1016/j.sste.2021.100471. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Kim H-M, Ryu D, Mallick BK, Genton MG. Mixtures of skewed kalman filters. J Multivar Anal. 2014;123:228–251. doi: 10.1016/j.jmva.2013.09.002. [DOI] [Google Scholar]
  30. Lagos-Álvarez B, Padilla L, Mateu J, Ferreira G. A kalman filter method for estimation and prediction of space-time data with an autoregressive structure. J Stat Plann Infer. 2019;203:117–130. doi: 10.1016/j.jspi.2019.03.005. [DOI] [Google Scholar]
  31. Li Y, Qiu Y, Xu Y. From multivariate to functional data analysis: fundamentals, recent developments, and emerging areas. J Multiv Anal. 2022;188:104806. doi: 10.1016/j.jmva.2021.104806. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Mardia KV, Goodall C, Redfern EJ, Alonso FJ. The kriged kalman filter. Test. 1998;7(2):217–282. doi: 10.1007/BF02565111. [DOI] [Google Scholar]
  33. Mattera R. A weighted approach for spatio-temporal clustering of covid-19 spread in Italy. Spat Spatio-temp Epidemiol. 2022;41:100500. doi: 10.1016/j.sste.2022.100500. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Maybeck PS. Square root filtering. Stoch Model, Eestimat Contr. 1979;1:368–409. [Google Scholar]
  35. Morana C. Multivariate modelling of long memory processes with common components. Computat Stat & Data Anal. 2007;52(2):919–934. doi: 10.1016/j.csda.2006.12.010. [DOI] [Google Scholar]
  36. Padilla L, Lagos-Álvarez B, Mateu J, Porcu E. Space-time autoregressive estimation and prediction with missing data based on kalman filtering. Environmetrics. 2020;31(7):e2627. doi: 10.1002/env.2627. [DOI] [Google Scholar]
  37. Paul SK, Jana S, Bhaumik P. A multivariate spatiotemporal model of covid-19 epidemic using ensemble of convlstm networks. J Instit Eng (India): Ser B. 2021;102(6):1137–1142. [Google Scholar]
  38. Porcu E, Furrer R, Nychka D. 30 years of space-time covariance functions. Computat Stat. 2019;13(12):e1512. [Google Scholar]
  39. Stein ML. Space-time covariance functions. J Am Stat Associat. 2005;100(469):310–321. doi: 10.1198/016214504000000854. [DOI] [Google Scholar]
  40. Stroud JR, Stein ML, Lesht BM, Schwab DJ, Beletsky D. An ensemble kalman filter and smoother for satellite data assimilation. J Am Stat Associat. 2010;105(491):978–990. doi: 10.1198/jasa.2010.ap07636. [DOI] [Google Scholar]
  41. Tsay W-J. Maximum likelihood estimation of stationary multivariate arfima processes. J Stat Comput Simul. 2010;80(7):729–745. doi: 10.1080/00949650902773536. [DOI] [Google Scholar]
  42. Vicente G, Goicoa T, Ugarte M. Bayesian inference in multivariate spatio-temporal areal models using inla: analysis of gender-based violence in small areas. Stoch Environ Resear Risk Assess. 2020;34(10):1421–1440. doi: 10.1007/s00477-020-01808-x. [DOI] [Google Scholar]
  43. Wang Y, Chaib-draa B. Knn-based kalman filter: an efficient and non-stationary method for gaussian process regression. Knowl-Based Sys. 2016;114:148–155. doi: 10.1016/j.knosys.2016.10.002. [DOI] [Google Scholar]
  44. Wikle CK. Hierarchical models in environmental science. Int Stat Rev. 2003;71(2):181–199. doi: 10.1111/j.1751-5823.2003.tb00192.x. [DOI] [Google Scholar]
  45. Xu K, Wikle CK. Estimation of parameterized spatio-temporal dynamic models. J Stat Plann Inf. 2007;137(2):567–588. doi: 10.1016/j.jspi.2005.12.005. [DOI] [Google Scholar]
  46. Zeng Q, Xu P, Wang X, Wen H, Hao W. Applying a bayesian multivariate spatio-temporal interaction model based approach to rank sites with promise using severity-weighted decision parameters. Accid Anal & Prevent. 2021;157:106190. doi: 10.1016/j.aap.2021.106190. [DOI] [PubMed] [Google Scholar]
  47. Zheng F, Zhu J. Balanced multivariate model errors of an intermediate coupled model for ensemble kalman filter data assimilation. J Geophys Res Oceans. 2008 doi: 10.1029/2007JC004621. [DOI] [Google Scholar]
  48. Zhu C, Byrd RH, Lu P, Nocedal J. Algorithm 778: L-bfgs-b: fortran subroutines for large-scale bound-constrained optimization. ACM Trans Math Softw (TOMS) 1997;23(4):550–560. doi: 10.1145/279232.279236. [DOI] [Google Scholar]
  49. Chaudhary A (2007) An introduction to the interface between c and r. Technical report
  50. Chen VY-J, Yang T-C, Jian H-L (2021) Geographically weighted regression modeling for multiple outcomes. Annals of the American Association of Geographers, pp 1–18
  51. Durbin J, Koopman SJ (2001) Time Series Analysis by State Space Methods, volume&nbsp;24 of Oxford Statistical Science Series. Oxford University Press, Oxford
  52. Matérn B (1986) Spatial variation. Lecture Notes in statistics, 36
  53. Padilla-Buriticá JI, Giraldo E, Castellanos-Domínguez G (2011) Eeg source localization based on multivariate autoregressive models using kalman filtering. In 2011 Annual international conference of the IEEE engineering in medicine and biology society, pp 7151–7154. IEEE [DOI] [PubMed]
  54. R Core Team (2015) R: a Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria
  55. Zes D (2014) Facile spacio-temporal modeling, forecasting with adaptive least squares and the kalman filter. J Environ Stat, 6(1)

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials


Articles from Stochastic Environmental Research and Risk Assessment are provided here courtesy of Nature Publishing Group

RESOURCES