Skip to main content
Sensors (Basel, Switzerland) logoLink to Sensors (Basel, Switzerland)
. 2019 Jun 24;19(12):2813. doi: 10.3390/s19122813

Traffic Estimation for Large Urban Road Network with High Missing Data Ratio

Kennedy John Offor 1,*, Lubos Vaci 2, Lyudmila S Mihaylova 1
PMCID: PMC6631281  PMID: 31238533

Abstract

Intelligent transportation systems require the knowledge of current and forecasted traffic states for effective control of road networks. The actual traffic state has to be estimated as the existing sensors does not capture the needed state. Sensor measurements often contain missing or incomplete data as a result of communication issues, faulty sensors or cost leading to incomplete monitoring of the entire road network. This missing data poses challenges to traffic estimation approaches. In this work, a robust spatio-temporal traffic imputation approach capable of withstanding high missing data rate is presented. A particle based approach with Kriging interpolation is proposed. The performance of the particle based Kriging interpolation for different missing data ratios was investigated for a large road network comprising 1000 segments. Results indicate that the effect of missing data in a large road network can be mitigated by the Kriging interpolation within the particle filter framework.

Keywords: particle filtering, road traffic, state estimation, Bayesian inference, Kriging, missing data imputation

1. Introduction

Intelligent Transportation Systems (ITS) require accurate knowledge of traffic states for effective traffic monitoring and control. Traffic states are usually estimated from noisy sensor measurements [1] using various approaches, which can typically be subdivided into model based approaches, data driven approaches or streaming-data-driven [2]. An overview of the different modelling methodologies is given in Refs. [3,4]. These modelling methodologies include microscopic, macroscopic and mesoscopic approaches. Microscopic traffic models [3,4,5,6] describe the motion of each individual vehicle with a high level of detail. Hao et al. [7,8] recently proposed a model predictive control (MPC) termed urban cell transmission model (UCTM) for optimal switching of traffic lights at intersections based on the predicted traffic.

In macroscopic models [9,10], traffic state is represented by aggregating behaviour of the traffic, usually in terms of the average speed and the average density over a given period of time. Mesoscopic models [3] employ some features of microscopic and macroscopic approaches by utilising varying levels/degrees of detail to model traffic behaviour. This is achieved by modelling some locations with aggregated measurements as in macroscopic and the remaining locations are modelled down to the details of individual vehicles as is done in the case of microscopic.

Macroscopic models are sufficient to produce acceptable estimation accuracy when compared to the computational overhead of the microscopic models. Hence, they are the preferred choice for most practical purposes such as traffic control/management, road pricing and changes in infrastructure. Most traffic estimation approaches are model based [2], while the new trend is to develop data driven approaches [11].

Data driven methods rely on historical data or streaming/real-time data. Within the last decade, there has been growing interest in applying Kriging to traffic state prediction; for directional traffic volume using global position system (GPS) data [12], annual average traffic count interpolation using origin-destination data [13], estimating annual average daily traffic [14,15], traffic volume prediction [16,17], and traffic volume imputation [18].

Kriging is one of the most flexible data driven methods originally employed in geo-statistics or spatio-temporal analysis. Kriging exploits the spatial dependence (either covariance or variogram) using a weighted sum of observed data points to interpolate the values at locations of interest. It was initially used for copper mining by Krige [19] and was developed further by Matheron [20]. Since then, it has been applied in other fields such as spatial analysis and computer experiments. In the recent times, the method has been applied in traffic prediction [12].

One of the major challenges faced in traffic prediction is the issue of missing or sparse data. Traffic measurements are generally captured with different types of sensors and transmitted through communication infrastructure for processing and utilisation. These infrastructures are subject to failure and malfunction occasionally leading to incomplete/missing data, sometimes more than 40% [21]. The problem of sparse data is caused by the high cost of installing and managing traffic measurement devices making them impractical to cover all locations needed for effective observation of the full road network.

To address these challenges, researchers resorted to various methods and approaches such as missing data imputation [22], compressive sensing and historical averages [23], and Kriging interpolation [16]. In Ref. [24], a particle filter (PF) with the stochastic compositional model (SCM) is developed to estimate traffic states in motorways. Boundary measurements (inflow and outflow) were used to estimate the traffic state within the segments. The study reported that missing boundary measurements affected the estimated accuracy. A solution to the problem of missing data in particle filter measurement update was proposed in our previous work [25]. Here Kriging methods were used to interpolate the missing data which is subsequently used for the computation of the PF likelihood for traffic state estimation.

This work extends the approach of [25] for a larger road network of several kilometres with 1000 segments under the influence of missing data and/or sensor failure. The task of training such large segments would be resource intensive, hence the reduced measurement space, proposed in Ref. [26], was used to select the most influential segments in the road network. The drawback of Ref. [26] is that the most influential segments are selected based on only available measurements without considering any missing values. This would mean that if some interconnected segments have missing data, they would not be used leading to information loss. To address this, we propose a new approach, that combines the benefits of both approaches ([25,26]) in a single framework. First, the most probable segments are selected using the column based matrix decomposition (CBMD). Second, when there are missing measurements in the segments, Kriging is used to estimate the measurements before the measurement update step (see Section 5.3).

The present work differs from Refs. [12,16] in the following ways. Whereas [12,16] employed Kriging for predicting vehicular speed, our approach used Kriging to estimate missing vehicular flow and speed measurements which is then applied to the computation of measurement update step for particle filter estimation. In addition, our approach combines speed and vehicular flow estimation unlike the former which predicted only speed. The approaches used in Refs. [12,16] are similar, the only difference being that the former proposes the use of alternate distance metric called Approximate Road Distance Network (ARDN) to replace the Euclidean distance metric.

The contribution of this work is twofold: (i) A robust spatio-temporal traffic imputation that can withstand higher missing data proportion via information exchange between correlated segments. (ii) the use of Kriging and particle filter to address the effects of high missing/sparse data. A Column based matrix decomposition is used to select most influential segments in large road network and then imputing any missing measurements in the selected segments using Kriging.

The rest of the paper is organised as follows. Section 2 discusses some related work followed by the presentation of traffic flow and measurement model used in this work in Section 3. A background theory of random sets, covariance and variograms, which is the building block of Kriging is presented in Section 4. Recursive Bayesian estimation and particle filters (PF) are presented in Section 5. The proposed method was discussed in Section 5.3. Results and discussion are presented in Section 6.2 with conclusions being drawn in Section 7.

2. Related Work

A review of three different missing data imputation methods was presented by Ref. [27]. These include interpolation, prediction and statistical learning. The interpolation method imputes missing measurement at the particular location by averaging all historical measurements at that location at similar times of day. Prediction methods use a deterministic mathematical description to model the relationship between historical and future data. The statistical methods on the other hand treats the traffic as a random variable and tries to capture stochastic nature of the traffic pattern into the imputation algorithm.

In Refs. [28,29,30], a multi-resolution approximation using linear combinations of basis functions was proposed to address computational complexity of large datasets. The novelty in Ref. [28] lies in the use of multiple basis functions computed at lower resolutions closer to observation locations and then combining them to capture the different covariance functions with varying properties. The solution is achieved by dividing the spatial domain recursively into small regions and smaller sub-regions until the fine-scale dependencies are captured.

Nychka et al. [28] used radial basis functions (RBF) and special type of Gaussian Markov random field (GMRF) called spatial autoregressive (SAR) model to model the spatial correlation among the coefficients while Katzfuss et al. [30] automatically determines the appropriate basis/covariance function. Whereas in Ref. [29], it was assumed that the sub-domains are independent, Ref. [30] assumed depended sub-domains and performed full-scale approximation. As the computations are done locally in parallel, it is possible to fuse multisensor data sources, in which case, the different covariance functions are used for each data source. While both mentioned that the approach could be extended to non-stationary functions, it was not implemented nor was there a derivation for such. The different methods on missing traffic data imputation considered small road network in the range of a few kilometres.

3. Model Formulation

Consider a general discrete time state space system of a form,

xk+1=fk(xk)+vkorp(xk+1|xk), (1)
zk=hk(xk)+wkorp(zk|xk), (2)

where fk is a nonlinear function of the target state vector xk, hk represents a nonlinear relationship between sensor output zk and target state vector xk affected by a measurement noise wk. In addition, p(xk+1|xk) is the probability density function of the new state xk+1 given the previous state xk, and p(zk|xk) is the likelihood function of the measurement zk given the state xk.

3.1. Stochastic Compositional Traffic Flow Model

In this work, a stochastic compositional model (SCM) [31] is considered for modelling of a motorway/freeway vehicle traffic evolution expressed as p(xk+1|xk) in Equation (1). This extended cell-transition model incorporates traffic speed and uses forward and backward waves to describe the complex relationship between traffic behaviour especially for a large road network. SCM utilises sending and receiving functions which models the stochastic nature of traffic state evolution. The vehicles that are able to leave a cell are represented by receiving functions while those that are allowed to enter a cell are determined by the receiving functions. The receiving functions are usually less than or equal to the sending functions to obey the law of conservation of vehicles.

In the SCM, the road network is divided into a given number of cells, n, also called segments. Each segment has a length Li and number of lanes li as shown in Figure 1, where i=(1,,n). At any given time period k, a certain number of vehicles Qi,k crosses the boundary between two segments i and i+1. The number of vehicles in a given cell i within the same time period k is represented by Ni,k with their average speed given by vi,k.

Figure 1.

Figure 1

Stochastic compositional model (SCM) road network showing segments and measurement points [31].

The overall state vector at time k is given by xk=[x1,kT,x2,kT,,xn,kT]T where xi,k=[Ni,k,vi,k]T is the local state vector at segment i. Equations (3)–(5) models traffic state evolution within the cells.

x1,k+1=f1(Qkin,vkin,x1,k,x2,k,η1,k) (3)
xi,k+1=fi(xi1,k,xi,k,xi+1,k,ηi,k) (4)
xn,k+1=fn(xn1,k,xn,k,Qkout,vkout,ηn,k) (5)

where Qkin, called inflow represents the number of vehicles entering the first cell with average speed vkin, Qkout, called outflow represents the number of vehicles leaving the last segment with average speed vkout. The inflow and outflow vehicles with their average speeds are called the boundary conditions and are supplied to the model at the beginning.

3.2. Measurement Model

Consider the road network shown in Figure 1 divided into different segments with n boundaries. The traffic state at a boundary jJ=1,2,,n is sampled at discrete time steps ts, s=1,2,, to give zj,s=(Qj,s,vj,s)T. The measurements at all the boundaries are collected into a matrix given by Zs=(z1,sT,z2,sT,,zn,sT)T. The relationship between the sampling interval Δts and the state update time step Δtk Equation (1) is such that sampling interval is split into q state update time steps. That is, Δts=qΔtk. The measurement model and noise is represented by Equation (6) with error ξs = (ξQj,s,ξvj,s)T. With the assumption of Gaussian noise, zj,s can be expressed as:

zj,s=Qj,svj,s+ξs. (6)

4. Random Sets, Covariances and Variograms

4.1. Random Set

Let z(r) be a non-stationary set of m measurements z(r)=[z(r1),z(r2),,z(rn)]T, in our case average vehicle speeds v(r) or vehicle counts N(r), observed in segments i=1,,n at locations ri=(r1,r2,,rn), where ri=(rix,riz)T. Note that location of each sensor is uniquely defined by the segment topology (Figure 1). Then, according to Equation (7),

z(r)Nμ(r),K(r) (7)

the random set of variables z(r) can be approximated by a Gaussian process which is uniquely defined by the drift, i.e., the mean field μ(r)=E[z(r)], and the corresponding covariance K(rT,r)=z(r)μ(r)Tz(r)μ(r). Individual samples z(ri) of the set Equation (7) can be also expressed as

z(ri)=μ(ri)+δ(ri) (8)

with drift μ(ri) and residual δ(ri). Based on the degree to which the moments, in our case mean and variance of the random set z(r) are dependent (or independent) on a spatial relationship between points ri=(r1,r2,,rn), the 1st or the 2nd-order of stationarity can be recognised. While, the 1st-order stationary random set assumes a constant mean E[z(r)]=μ(r)=μ, the 2nd-order stationary random field assumes a linear drift between the increments E[z(r)]=μ(r)=n=0Nαnf(r). In the rest of this sequel, we assume that the random set z(r) is intrinsic and stationary on the 2nd-order.

4.2. Covariance

By assuming that random variables z(r) are 2nd-order stationary and isotropic, the covariance function K(ri,rj) in Equation (7) reads as follows:

K(ri,rj)=E(z(ri)μ)T(z(rj)μ),K(ri,rj)=E(z(ri)μ)T(z(ri+h)μ). (9)

The role of covariance function K(ri,rj) is to model the correlation between measurements z(ri) and z(rj) observed at locations ri and rj based on their separation distance called a lagh. Since the correlation between two random variables solely depends on their spatial distance, and not at all on their location, the lag can be conveniently expressed as an Euclidean l2 norm, defined as h=||rjri||2. Above statements can be summarised into the following equalities, i.e., the isotropy assumptions:

h=rjri||h||2=hK(ri,rj)=K(ri,ri+h)=K(h). (10)

For a straight stretch of motorway (Figure 1) this is equivalent to the path length through the road network.

The process of covariance function K(ri,rj) modelling requires to find a covariance curve that has the best fit to the empirical data Equation (9), possibly being a subject to constraints. The covariance models to choose from include exponential, spherical, Gaussian, linear or power model [14]. In this work, the best fit for the dataset was achieved by the exponential model depicted in Figure 2a and given by

K(ri,rj)=caif|rjri|=0,(ca)e3|rjri|b}if|rjri|>0, (11)

where a is the nugget, b is the range and c is still.

Figure 2.

Figure 2

Covariance and variogram models.

As can be observed in Figure 2a, observe that the covariance function decreases with distance, so it can be thought of as a similarity function.

5. Recursive Bayesian Estimation

5.1. Bayesian Estimation

Consider a general discrete time state space system represented as in Equations (1) and (2). The goal of Bayesian estimation is to infer the state variable xk as defined in Section 3.1 with the available sensor measurements z1:k. By using the Bayesian framework, this estimation problem relates to the recursive evaluation of the probability density function (PDF) p(xk|z1:k) in two consecutive steps, the prediction and the measurement update of the state vectors.

p(xk1|z1:k1)UpdatePredictionp(xk|z1:k1) (12)
p(xk|z1:k1)UpdateMeasurementp(xk|z1:k) (13)

The prediction state density p(xk|z1:k1) of state xk is calculated from the prior PDF p(xk1|z1:k1) by using Chapman-Kolmogorov equation

p(xk|z1:k1)=p(xk|xk1)p(xk1|z1:k1)dxk1. (14)

Equation (14) follows the 1st order Markov property which assumes that p(xk|z1:k1) only depends on state xk and xk1 at time k and k1 respectively. The measurement update p(xk|z1:k) is computed from the prior distribution of Equation (14) and measurements z1:k by a Bayesian formula which results in

p(xk|z1:k)=p(zk|xk)p(xk|z1:k1)p(zk|z1:k1) (15)

The 1st order Markov property for Equation (15) implies that p(xk|z1:k) only depends on measurement zk at time k.

5.2. Particle Filter

Arguably, the most popular algorithm for nonlinear recursive estimation is the particle filter (PF), extensively evaluated in Ref. [32]. PF represents any arbitrary probability density function p(xk|z1:k) by samples or particles xkl, where l=1,,Np is the number of particles, i.e.,

xklp(xk|z1:k), (16)

The particles are used to form an approximative distribution as

p(xk|z1:k)p^(xk|z1:k)=i=1Nwk|klδ(xkxkl), (17)

where p^(xk|z1:k) is an approximated distribution, δ(xkxkl) is a the Dirac delta function and wk|kl the weights of the particles satisfying l=1Npwk|k=1. The time update of the Bayesian recursion Equation (1) is in case of PF evaluated as

p(xk|z1:k1)p(xk|xk1)l=1Nwk1|k1lδ(xk1xk1l)dxk1,l=1Nwk1|k1lp(xk|xk1)δ(xk1xk1l)dxk1,l=1Nwk1|k1lp(xk|xk1). (18)

The particles xk1l in above Equation (18) are sampled from proposal distribution π(xk|xk1l), i.e., xk1lπ(xk|xk1l). Proposal distribution is very often defined by the state transition PDF, that is π(xk|xk1l)=p(xk|xk1i). In this case, the weights updates result to

wk|k1l=p(xk|xk1l)π(xk|xk1l,zk)wk1|k1l,=p(xk|xk1l)p(xk|xk1l)wk1|k1l=wk1|k1l. (19)

The measurement update p(xk|z1:k) (2) is computed by a Equation (15), which can be in terms of the particles xkl represented as

p(xk|z1:k)p(zk|xk)p(xk|z1:k1),l=1Npwk|k1lp(zk|xk)δ(xkxkl). (20)

Similarly, the particle filter weights are updated as

wk|kl=wk|k1lp(zk|xkl)l=1Npwk|k1lp(zk|xkl). (21)

The denominator in Equations (15) and (21) is only a normalising factor independent of xk thus can be safety omitted if the distribution is numerically normed as shown by Equations (20) and (21).

wk|klwk|k1lp(zk|xkl),wk|k1lp(zk|xkl). (22)

The MC recursion tempt to degrade over time as all relative weights would tend to zero except for one that tends to one. Therefore, when particle depletion ratio reaches 0.5 a Sampling Importance Resampling (SIR) or Sampling Importance Sampling (SIS) techniques are applied in the recursion.

5.3. Missing Measurement Interpolation and Improved Likelihood Computation

For a large road network with many segments, using all the measurements in the particle filter measurement update step becomes computationally intensive. Column based matrix decomposition approach similar (as earlier stated in Section 1) to the work of [26] is employed to select the most probable segments that would give acceptable accuracy.

The idea is to select m most influential segments from all available segments n using CBMD and then estimating missing measurements (if any) of the most influential segments using Kriging for improved particle likelihood computation. Let ZnRk×n represent a set of all segments or measurement locations in a given time period, where the rows, k is number of time instances at which the measurements were taken and columns n number of road segments.

The goal of CBMD is to approximate the measuremtns Zn with a subset of measurements ZmRk×m where m<n is a subset of the measurements using singular value decomposition (SVD) as Ref. [33]:

Zm=ZnΦ (23)

where ΦRn×m is the transformation matrix that expresses every column of all measurement Zn in terms of the basis in Zm. Having computed the SVD, the right singular matrix is used to assign a probability Pzi to each selected location according to:

Pzi=1rj=1rvi,j2,i=1,...,n (24)

where vi,j is the ith element of the jth right singular vector and r is the rank of the matrix. From the probabilities computed, m locations with the highest probabilities are chosen as the reduced measurement to approximate the entire network which is used in computing the particle filter likelihood during the measurement update step.

The likelihood function term p(zk|xk) in Equation (21), is computed when a new measurement arrives. The performance of the PF degrades substantially when there is a missing measurement. For the multivariate Gaussian distribution, the PDF is given by:

p(zk|xk)=12π|R|e0.5νR1νT, (25)

where R is the covariance matrix of the measurement data, |R|det(R) is the determinant of R and ν is the difference between the PF predicted value (z¯s) and measurement (zs), given by:

ν=zsz¯s. (26)

The measurement matrix zs can be expressed as,

zs=zsmeasmeasurement from sensor;z^skrigestimated by Kriging. (27)

where z^skrig represents the value estimated by Kriging (when measurement is not available). Note that the sampling time index ts is split into q update time indices tk as mentioned in Section 3.2. The measurement update state of the PF is performed only when tkts. The modified particle method is presented in Algorithm 1.

Algorithm 1 Particle Filter for Traffic State Estimation with Kriging Estimated Measurements [24]
  1. Road network approximation

    Use compressed sensing to select m most significant locations out of the n segments to be used for the measurement update step as defined in Section 5.3.

  2. Initialisation

    At k=0; define all boundary conditions: number of samples, weight of samples as below,

    For l=1,Np, Np number of particles;
    • generate Np samples {x0(l)} from the initial distribution p(x0)
    • initialise the particle weights w0(l)=1Np.

    End for

  3. Start the iteration for k=1,2,
    • (a)
      Prediction stage
      For l=1,...,Np,
      sample xk(l)p(xk|xk1(l)) according to SCM model equations
      End for
    • (b)
      Measurement Update:
      This step is performed when the sampling time ts equals the iteration count tk as defined in Section 3.2
      i. Estimate missing measurements in the m most significant locations with Kriging using Equations (28) and (33)
      ii. Compute the likelihoods
      Based on Equation (6) compute the likelihood, p(zs|xs(l)) of the particles using Equations (25)–(27)
      iii. Update the weights of the particles using the likelihood p(zs|xs(i)) calculated from Equation (6)
      For l=1,...,Np
      ωs(l)=ωs1(l)p(zs|xs(l))
      End For
      iv. Normalise the weights: ω^s(l)=ωs(l)l=1Npωs(l).
    • (c)
      Update the predicted states (Output): x^s=l=1Npω^s(l)xs(l)
    • (d)
      Re-sample the weights (Selection) only when tk = ts

5.4. Missing Data Estimation via Kriging Models

The underlying idea of Kriging is to obtain the response z(ru), interpreted as a random variable positioned at the location ru, by interpolating random variables z(r)=[z(r1),z(r2),,z(rn)]T from Equation (7), i.e., observations z(ri), i=1,,n at locations ri. The Kriging predictor incorporates the covariance structure among the observation points z(ri) into the weights w(ru)=[w1(ru),w2(ru),,wn(ru)]T for predicting z^(ru) as a linear combination:

z^(ru)=i=1nwi(ru)z(ri)=wT(ru)z(r). (28)

In order to assess the accuracy of the Kriging prediction z^(ru) w.r.t the real (true) value z(ru) an error ϵ(ru) is declared

ϵ(ru)=z(ru)z^(ru). (29)

The following criteria, evaluated in terms of mean Eϵ(ru) and variance Varϵ(ru)=σϵ2(ru) of the prediction error ϵ(ru) Equation (29), apply to any type of Kriging interpolation:

  • Lack of bias, implies that Eϵ(ru)=0 and therefore Ez^(ru)=Ez(ru). Thus, the Kriging interpolation is said to be the globally unbiased Equation (30).
    E[z(ru)z^(ru)]=0 (30)
  • Minimum variance, implies that the mean of the squared deviations Var[ϵ(ru)]=σϵ2(ru) must be the minimal Equation (31),
    min1Tw=1Var[z(ru)z^(ru)]ormin1Tw=1σϵ2(ru), (31)
    subject to
    i=1mwi=1Tw=1 (32)

Kriging weights w(ru) are chosen such that the mean squared prediction error σϵ2(ru) Equation (31), also known as Kriging variance or Kriging error is minimised as min1Tw=1σϵ2(ru) over all z(ru) Equation (31) subject to the unbiased conditions of Equations (30) and (32) by a Lagrange multiplier 2λ to give,

w(ru)=K(ri,rj)1K(ri,ru)μ1. (33)

where matrix K(ri,rj) is the covariance between the individual samples and column vector k(ri,ru) is a covariance function between samples and the point to interpolate.

6. Performance Evaluation

A road network with 1000 segments was simulated using SUMO software [34] to validate the proposed method. The segments are spaced 0.5 km apart and measurements (number of vehicles crossing each segment boundary with their average speed) taken every second. Traffic signs were installed at some locations to model the effect of congestion.

This is an extension of the previous work [25] where a smaller number segments was considered. The aggregate traffic flow and speed were sampled every 60 s and the results collected over a period of 10,800 s (3 h). Two types of vehicles, bus and passenger car was defined with the parameters as in Table 1.

Table 1.

SUMO simulation parameters.

Car Bus
Max speed 25 m/s 20 m/s
Acceleration 1.0 m/s2 0.8 m/s2
Deceleration 4.5 m/s2 4.5 m/s2
Sigma (driver perfection) 0.5 0.5
Length 5 m 10 m
Minimum Separation 2.5 m 3 m

The vehicles were added randomly into the network through the inflow boundary every one second and they travel through the network until they get to the last boundary when they leave the network. As a vehicle crosses each induction loop, it is counted with its speed. The average speed of the vehicles arriving at an induction loop over a period is recorded as the average speed. The entire statistics, flow, occupancy, and speed are collected in an output file for further processing.

6.1. Simulation Design

To simulate different scenarios such as congestion and free flow, (i) the number of lanes were decreased from 3 to 2, and (ii) the rate of vehicle injection into the network is varied at different time periods. Figure 3 and Figure 4 show the spatio-temporal evolution of the traffic and their corresponding average speed, respectively, for a 100 segment section. The average speed of the vehicles varies around 100 km/h when the flow was around 2000 veh/h. Between time interval [1.5 h, 1.7 h], the flow was increased slightly to cause congestion, this resulted to a decrease in the average speed as can be seen in the first spike from Figure 4.

Figure 3.

Figure 3

Spatio-temporal evolution of traffic flow for the 100 segment.

Figure 4.

Figure 4

Spatio-temporal evolution of traffic speed for the 100 segment.

Observe that the effect was felt more close to the inflow boundary. The vehicles’ speed increases marginally as they move into the network. Between time interval [1.6 h, 1.9 h] the flow was decreased leading to increase in the average speed. Finally, the number of lanes in segments 10 to 14 were reduced from 3 to 2 between time interval [2.4 h, 2.4 h] while maintaining vehicle injection rate. This results to substantive decrease in speed as can be seen (cone-shaped) in Figure 4. As the vehicles leave the segments with closed lanes, there is increase in their speed again.

6.2. Results and Discussion

In order to test the prediction accuracy for different levels of sparsity, a statistical measure namely the root mean squared error (RMSE), was computed. Note, zi is the ground truth or actual measurement, z^i is the estimated value and mr is number of independent Monte Carlo runs. The measurements at some boundaries were randomly removed each time and then estimated using the proposed method. Different missing data rates (from 10, 20, …, 70%) were investigated by randomly removing measurements at some locations using leave one out cross validation. This was repeated for 100 Monte Carlo runs and the average value used.

RMSE=1mri=1mr[ziz^i]2. (34)

Figure 5 and Figure 6 show the average estimation error for the different missing data ratios and the number of segments. It would be observed that the prediction error increases with the missing data rate. This is expected as less data is available for the computation. This effect could be reduced further by incorporating a mechanism known as multi resolution approach [30].

Figure 5.

Figure 5

RMSE of speed at different missing data ratios.

Figure 6.

Figure 6

RMSE of flow at different missing data ratios.

The plots also show that the higher the number of segments used, the better the accuracy. This could be attributed to the fact that there is better information exchange within the network and hence, on average, more segments with available data are used for the higher dimensional segments scenario. This is in agreement with [35] where estimation accuracy in the presence of sparse sensor data was improved by exchanging particle weights between segments. For instance, a 10 segment network with 70% missing data ratio will result to using only 3 data points to estimate the remaining 7 missing locations. The chances of these 3 locations correlating with the other 7 is lower compared to when 30 out of 100 data points are available.

Figure 7 and Figure 8 show the spatio-temporal evolution of the traffic flow and the corresponding average speed for the 100 segment scenario.

Figure 7.

Figure 7

Estimated flow for the 100 segment with 30% missing data.

Figure 8.

Figure 8

Estimated speed for the 100 segment with 30% missing data.

The number of vehicles crossing each boundary in space and their associated average speed is represented by the colour bar.Compared to the ground truth shown in Figure 3 and Figure 4, it is evident that estimated number of vehicles crossing segment boundaries and the associated speeds have been estimated with a good accuracy. Observe also that the estimated flow and speed captured the periods where there is drop in number of vehicles and decrease in speed.

7. Conclusions

This paper presented a traffic estimation for a large road network with different missing data ratios. The computational overhead of the large network was addressed by using a method called reduced measurement space proposed by [26] to select the most influential and information rich segments in the road network. These are subsequently used in the particle filter measurement update step. Missing data in the selected segments are imputed using Kriring. A 1000-segment road network was simulated using SUMO. Different missing data ratios ranging from 10% to 70% were tested for different sizes of road network ranging from 100 to 1000 segments.

The results indicate that considering a larger number of segments would reduce the overall estimation error even when the missing data ration is high. From the foregoing results and discussion, it is recommended that the best estimation accuracy would be obtained when the entire road network is considered at once. The effects of computational overhead could further be reduced by using a distributed approach with a central control unit.

Author Contributions

Ideas/conceptualisation, L.S.M., K.J.O.; methodology, L.S.M., K.J.O. and L.V.; software: particle filter/traffic models, L.S.M.; Kriging/sparsity, K.J.O.; implementation, K.J.O., L.S.M. and L.V.; validation, K.J.O., L.V.; formal analysis, K.J.O., L.S.M. and L.V.; investigation, K.J.O.; writing—original draft preparation, K.J.O.; writing—review and editing, L.V., L.S.M., K.J.O.; visualisation, K.J.O.

Funding

The authors acknowledge the support from the SETA project funded from the European Union’s Horizon 2020 research and innovation programme under grant agreement No. 688082 and the Tertiary Education Trust Fund (TETFund, Nigeria). The APC was funded by Department of Automatic Control and Systems Engineering, University of Sheffield.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

References

  • 1.Ferrara A., Sacone S., Siri S. Freeway Traffic Modelling and Control. Advances in Industrial Control. Springer International Publishing; Cham, Switzerland: 2018. State estimation in freeway traffic systems; pp. 169–190. [Google Scholar]
  • 2.Seo T., Bayen A.M., Kusakabe T., Asakura Y. Traffic state estimation on highway: A comprehensive survey. Annu. Rev. Control. 2017;43:128–151. doi: 10.1016/j.arcontrol.2017.03.005. [DOI] [Google Scholar]
  • 3.Hoogendoorn S.P., Bovy P.H.L. Generic gas-kinetic traffic systems modeling with applications to vehicular traffic flow. Transp. Res. Part B. 2001;35:317–336. doi: 10.1016/S0191-2615(99)00053-3. [DOI] [Google Scholar]
  • 4.Papageorgiou M., Ben-Akiva M., Bottom J., Bovy P.H.L., Hoogendoorn S.P., Hounsell N.B., Kotsialos A., Mcdonald M. Handbooks in Operations Research and Management Science. Volume 14 Elsevier; Amsterdam, The Netrerlands: 2007. ITS and traffic management. Chapter 11. [Google Scholar]
  • 5.Pipes L.A. An operational analysis of traffic dynamics. J. Appl. Phys. 1953;24:274–281. doi: 10.1063/1.1721265. [DOI] [Google Scholar]
  • 6.Gipps P.G. A behavioural car-following model for computer simulation. Transp. Res. Part B Methodol. 1981;15:105–111. doi: 10.1016/0191-2615(81)90037-0. [DOI] [Google Scholar]
  • 7.Hao Z., Boel R., Li Z. Model based urban traffic control, part I: Local model and local model predictive controllers. Transp. Res. Part C Emerg. Technol. 2018;97:61–81. doi: 10.1016/j.trc.2018.09.026. [DOI] [Google Scholar]
  • 8.Hao Z., Boel R., Li Z. Model based urban traffic control, part II: Local model and local model predictive controllers. Transp. Res. Part C Emerg. Technol. 2018;97:23–44. doi: 10.1016/j.trc.2018.09.025. [DOI] [Google Scholar]
  • 9.Messmer A., Papageorgiou M. METANET: A macroscopic simulation program for motorway networks. Traffic Eng. Control. 1990;31:466–470. [Google Scholar]
  • 10.Daganzo C.F. The cell transmission model: A dynamic representation of highway traffic consistent with the hydrodynamic theory. Transp. Res. Part B. 1994;28:269–287. doi: 10.1016/0191-2615(94)90002-7. [DOI] [Google Scholar]
  • 11.Lu H.-P., Sun Z.-Y., Qu W.-C. Big data-driven based real-time traffic flow state identification and prediction. Discret. Dyn. Nat. Soc. 2015;2015:1–11. doi: 10.1155/2015/284906. [DOI] [Google Scholar]
  • 12.Braxmeier H., Schmidt V., Spodarev E. Kriged road-traffic maps. In: Pilz J., editor. Interfacing Geostatistics and GIS. Springer; Heidelberg, Germany: 2009. pp. 105–119. Chapter 9. [Google Scholar]
  • 13.Lowry M. Spatial interpolation of traffic counts based on origin-destination centrality. J. Transp. Geogr. 2014;36:98–105. doi: 10.1016/j.jtrangeo.2014.03.007. [DOI] [Google Scholar]
  • 14.Wang X., Kockelman K.M., Murray W.J. Forecasting network data: Spatial interpolation of traffic counts from Texas data. Transp. Res. Rec. 2009;2105:100–108. doi: 10.3141/2105-13. [DOI] [Google Scholar]
  • 15.Gastaldi M., Gecchele G., Rossi R. Estimation of annual average daily traffic from one-week traffic counts. A combined ANN-Fuzzy approach. Transp. Res. Part C Emerg. Technol. 2014;47:86–99. doi: 10.1016/j.trc.2014.06.002. [DOI] [Google Scholar]
  • 16.Zou H., Yue Y., Li Q., Yeh O.G.A. An improved distance metric for the interpolation of link-based traffic data using kriging: A case study of a large-scale urban road network. Int. J. Geogr. Inf. Sci. 2012;26:667–689. doi: 10.1080/13658816.2011.609488. [DOI] [Google Scholar]
  • 17.Song Y., Wang X., Wright G., Thatcher D., Wu P., Felix P. Traffic volume prediction with segment-based regression Kriging and its implementation in assessing the impact of heavy vehicles. IEEE Trans. Intell. Transp. Syst. 2019;20:234–243. doi: 10.1109/TITS.2018.2805817. [DOI] [Google Scholar]
  • 18.Yang H., Yang J., Han L.D., Liu X., Pu L., Chin S.-M., Hwang H.-L. A Kriging based spatiotemporal approach for traffic volume data imputation. PLoS ONE. 2018;13:1–11. doi: 10.1371/journal.pone.0195957. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Krige D.G. A statistical approach to some basic mine problems on the Witwatersrand. J. Chem. Metall. Soc. S. Min. Afr. 1951;52:119–139. [Google Scholar]
  • 20.Matheron G. Principles of geostatistics. Econ. Geol. 1963;58:1246–1266. doi: 10.2113/gsecongeo.58.8.1246. [DOI] [Google Scholar]
  • 21.Zhang Y. A comparative study of three multivariate short-term freeway traffic flow forecasting methods with missing data. J. Intell. Transp. Syst. 2016;20:205–218. doi: 10.1080/15472450.2016.1147813. [DOI] [Google Scholar]
  • 22.Ladino A., Kibangou Y., Fourati H., Canudas de Wit C. Travel time forecasting from clustered time series via optimal fusion strategy; Proceedings of the 2016 European Control Conference (ECC); Aalborg, Denmark. 29 June–1 July 2016. [Google Scholar]
  • 23.Hawes M., Amer H.M., Mihaylova L. Traffic state estimation via a particle filter with compressive sensing and historical traffic data; Proceedings of the 2016 19th International Conference on Information Fusion (FUSION); Heidelberg, Germany. 5–8 July 2016; pp. 735–742. [Google Scholar]
  • 24.Mihaylova L., Boel R., Hegyi A. Freeway traffic estimation within particle filtering framework. Automatica. 2007;43:290–300. doi: 10.1016/j.automatica.2006.08.023. [DOI] [Google Scholar]
  • 25.Offor K.J., Hawes M., Mihaylova L.S. Short term traffic flow prediction with particle methods in the presence of sparse data; Proceedings of the 2018 21st International Conference on Information Fusion (FUSION); Cambridge, UK. 10–13 July 2018; pp. 1205–1212. [Google Scholar]
  • 26.Hawes M., Amer H.M., Mihaylova L. Traffic state estimation via a particle filter over a reduced measurement space; Proceedings of the 2017 20th International Conference on Information Fusion (Fusion); Xi’an, China. 10–13 July 2017. [Google Scholar]
  • 27.Li Y., Li Z., Li L. Missing traffic data: Comparison of imputation methods. IET Intell. Transp. Syst. 2014;8:51–57. doi: 10.1049/iet-its.2013.0052. [DOI] [Google Scholar]
  • 28.Nychka D., Bandyopadhyay S., Hammerling D., Lindgren F. A multiresolution Gaussian process model for the analysis of large spatial datasets. J. Comput. Graph. Stat. 2015;24:579–599. doi: 10.1080/10618600.2014.914946. [DOI] [Google Scholar]
  • 29.Katzfuss M., Hammerling D. Parallel inference for massive distributed spatial data using low-rank models. Stat. Comput. 2017;27:363–375. doi: 10.1007/s11222-016-9627-4. [DOI] [Google Scholar]
  • 30.Katzfuss M. A multi-resolution approximation for massive spatial datasets. J. Am. Stat. Assoc. 2017;112:201–214. doi: 10.1080/01621459.2015.1123632. [DOI] [Google Scholar]
  • 31.Boel R., Mihaylova L. A compositional stochastic model for real-time freeway traffic simulation. Transp. Res. Part B Methodol. 2006;40:319–334. doi: 10.1016/j.trb.2005.05.001. [DOI] [Google Scholar]
  • 32.Gustafsson F. Particle filter theory and practice with positioning applications. IEEE Aerosp. Electron. Syst. Mag. 2010;25:53–82. doi: 10.1109/MAES.2010.5546308. [DOI] [Google Scholar]
  • 33.Mitrovic N., Asif M.T., Dauwels J., Jaillet P. Low-dimensional models for compressed sensing and prediction of large-scale traffic data. IEEE Trans. Intell. Transp. Syst. 2015;16:2949–2954. doi: 10.1109/TITS.2015.2411675. [DOI] [Google Scholar]
  • 34.Behrisch M., Bieker L., Erdmann J., Krajzewicz D. SUMO—Simulation of urban mobility—An overview; Proceedings of the SIMUL 2011, The Third International Conference on Advances in System Simulation; Barcelona, Spain. 23–28 Oktober 2011; pp. 63–68. [Google Scholar]
  • 35.Mihaylova L., Hegyi A., Gning A., Boel R.K. Parallelized particle and Gaussian sum particle filters for large-scale freeway traffic systems. IEEE Trans. Intell. Transp. Syst. 2012;13:36–48. doi: 10.1109/TITS.2011.2178833. [DOI] [Google Scholar]

Articles from Sensors (Basel, Switzerland) are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES