Skip to main content
Sensors (Basel, Switzerland) logoLink to Sensors (Basel, Switzerland)
. 2020 Mar 4;20(5):1412. doi: 10.3390/s20051412

Track-to-Track Association for Intelligent Vehicles by Preserving Local Track Geometry

Ke Zou 1,*, Hao Zhu 1, Allan De Freitas 2, Yongfu Li 1, Hamid Esmaeili Najafabadi 3
PMCID: PMC7085508  PMID: 32143498

Abstract

Track-to-track association (T2TA) is a challenging task in situational awareness in intelligent vehicles and surveillance systems. In this paper, the problem of track-to-track association with sensor bias (T2TASB) is considered. Traditional T2TASB algorithms only consider a statistical distance cost between local tracks from different sensors, without exploiting the geometric relationship between one track and its neighboring ones from each sensor. However, the relative geometry among neighboring local tracks is usually stable, at least for a while, and thus helpful in improving the T2TASB. In this paper, we propose a probabilistic method, called the local track geometry preservation (LTGP) algorithm, which takes advantage of the geometry of tracks. Assuming that the local tracks of one sensor are represented by Gaussian mixture model (GMM) centroids, the corresponding local tracks of the other sensor are fitted to those of the first sensor. In this regard, a geometrical descriptor connectivity matrix is constructed to exploit the relative geometry of these tracks. The track association problem is formulated as a maximum likelihood estimation problem with a local track geometry constraint, and an expectation–maximization (EM) algorithm is developed to find the solution. Simulation results demonstrate that the proposed methods offer better performance than the state-of-the-art methods.

Keywords: Gaussian mixture model, local track geometry, maximum likelihood estimation, sensor bias, track association

1. Introduction

Reliable situational awareness plays an essential role in intelligent vehicles and surveillance systems [1,2,3,4,5,6]. Typical intelligent vehicles employ various types of sensors, such as radio detection and ranging (radar), light detection and ranging (lidar), and video. The radar sensor determines the relative location and the radial velocity of objects by emitting radio signals. Radar measurements often consist of false alarm detections in addition to detections from real objects or targets while missing some target-originated detections. The lidar sensor uses laser light to detect objects. Compared to radar, it provides more detailed measurements at an increased cost. Video sensors are feature-rich with a wide field-of-view, but they are more sensitive to different illumination and weather conditions [1]. Since these sensors have different sensing capabilities, features, and accuracies, the use of multiple heterogeneous sensors can result in more reliable and multi-modal environment perception systems. Therefore, pedestrians, vehicles, and obstacles are typically detected and tracked using a multi-sensor system in intelligent vehicles [7,8,9]. A multi-sensor multi-target tracking module jointly estimates the states and the number of targets from sensor measurements in intelligent vehicles, and it can be broadly categorized as centralized or distributed. The advantage of the distributed tracking systems is that they can provide a degree of scalability and robustness not achievable by traditional centralized tracking systems [1].

Track-to-track association (T2TA) is a crucial task in distributed tracking to find the correspondence between local tracks from different sensors. It is commonly applied to combine the local tracks of a sensor with those of another sensor to form the global tracklist. For automotive applications, radar, lidar, and video sensors in environmental perception systems for intelligent vehicles use different coordinate systems and sampling frequencies. Therefore, a spatio-temporal calibration should be performed to align the detections from different sensors [1]. In practice, detection from radar, lidar, and video sensors cannot always be calibrated or aligned accurately [10]. Each sensor may cover a different part of the surveillance region with a detection probability of less than one. As a result, some local tracks from a sensor may not correspond to those of other sensors. The range, azimuth, and elevation biases of a radar sensor may lead to errors in the local tracks from that sensor. The relationship between radar sensor bias and local tracks is presented in Figure 1, where two radar sensors, A and B, and one target, T, are shown. The radar sensor bias leads to the reporting of the target T as tracks TA and TB by radar A and B, respectively. Note that if the biases in A and B radars are significant, the distance between TA and TB is correspondingly high. In this case, TA and TB is considered as originating from two different targets by the T2TA module. Therefore, T2TA in intelligent vehicles or surveillance systems suffers from many challenges, including missing detection and measurement bias. In this paper, the focus is on the problem of independent T2TA for each frame in the presence of missed detections and sensor bias.

Figure 1.

Figure 1

Relationship between sensor bias and local tracks.

To formulate the T2TA as an optimization problem, different statistical distances or metrics are proposed in literature [11,12,13,14,15,16,17,18,19,20,21,22,23,24]. In [11], a weighted statistical distance is proposed for T2TA with the assumption that the local estimation error of one sensor is independent of those of other sensors for the same target. In [13], the independence assumption is relaxed, and a modified statistical distance with dependent errors is developed for T2TA. In [19], three algorithms based on the squared Mahalanobis distance are investigated, and the nearest neighbor (NN) and global nearest neighbor (GNN) algorithms are applied to compute the distance between two tracks for T2TA. In [20], a likelihood function for T2TA from multiple sensors is derived, and the multidimensional assignment algorithm is employed to solve the optimal matching problem. State augmentation data, which combines the kinematic state information and the additional feature state information, is proposed to perform T2TA in [21]. In [25], a track association algorithm is proposed based on the permutation matrix to support the track-to-track multi-sensor data fusion for multiple targets in an autonomous driving system. It is worth noting that most T2TA algorithms in intelligent vehicles employ the conventional GNN algorithm [25,26,27,28].

Nevertheless, most of the above methods do not consider the presence of sensor bias. In reality, T2TA performance significantly degrades with sensor bias [29]. Literature addressing this problem can be roughly divided into batch and online approaches. The batch approach is an offline implementation that estimates the track association and sensor bias using all local tracks [30,31,32]. A joint sensor registration and track-to-track fusion method is derived using an equivalent measurement method in [30], while a pseudo-measurement approach is adopted to handle registration and track fusion simultaneously in [31]. In [32], a joint registration, data association, and fusion method in a distributed sensor network is formulated as a maximum likelihood (ML) optimization problem. An expectation-maximization (EM) algorithm is then proposed to perform the ML optimization, joint association, and bias removal through following an iterative strategy. However, these methods are susceptible to being trapped in local minima and have high computational costs. The online approach is a real-time implementation to perform track association with sensor bias. In [33], relative position information among neighboring tracks is analyzed, and a reference topology feature is derived for the absolute position information. An optimal sub-pattern assignment (OSPA) metric is also proposed to construct the association cost for T2TASB. In [34], the OSPA metric is modified by compensating for the relative azimuth bias. In [35], the T2TASB is formulated as a point set registration problem, and a coherent point drift (CPD) algorithm is proposed to perform T2TASB. In the CPD algorithm, local tracks of one sensor are represented by Gaussian mixture model (GMM) centroids [4,36], where local tracks of all sensors are fitted to those of a reference sensor. Still, the CPD algorithm only exploits the relationship among local tracks from different sensors, i.e., it does not utilize the relative geometric relationship between a local track and its neighbors from each sensor. The geometry among neighboring local tracks is usually stable at least for a while and thus helpful in improving the T2TASB. Here, the geometry is inspired by the idea that the relationship between a local track and its neighbors from different sensors could be preserved after the transformation. Hence, the geometry among neighboring local tracks is usually stable at least for a while and thus helpful in improving the T2TASB.

In this paper, the problem of independent T2TA for each frame in the presence of missed detections and sensor bias is considered. A probabilistic method, called the local track geometry preservation (LTGP) algorithm, is proposed to handle T2TASB. In the proposed method, the local tracks of one sensor are represented by GMM centroids, and the local tracks of the other sensor are fitted to those of the first using a nonlinear transformation function. The local track geometry with k-connected neighborhood is developed, and the T2TASB is formulated as an ML optimization problem with an EM algorithm being proposed to address it.

Different from other literature, the main contributions of this paper are as follows:

  1. The mathematical formulation for T2TASB is presented. Moreover, the local track geometry with k-connected neighborhood is derived to improve the robustness and accuracy of T2TASB. The proposed method extends the CPD method by considering the geometric relationship between neighboring tracks.

  2. An EM algorithm is proposed for T2TASB. The optimal T2TASB correspondence matrix and transformation function between local tracks are estimated simultaneously.

  3. The performance of the proposed method is validated by the experiments and computer simulations using the KITTI dataset.

This paper is organized as follows. The formulation of T2TASB is presented in Section 2. In Section 3, the EM algorithm is used to estimate the parameters in the proposed method. The performance of the proposed approach is evaluated using computer simulations and experiments on the KITTI dataset in Section 4 and Section 5, respectively. Finally, conclusions and future work are discussed in Section 6.

2. A New Method for T2TASB

In this section, the T2TASB problem is formulated and a new solution is proposed. Let Xks denote the local tracks from sensor s at time k, where Xks=x1,ksT,x2,ksT,xNks,ksTT. The  xi,ks denotes the i-th track state estimate and the corresponding covariance from sensor s at time k, where k=1,2,K, i=1,2,Nks with K and Nks being the total number of discrete time steps and the number of tracks at time k by sensor s, respectively. Here, two sensors are applied to find the global track states, i.e., s=1,2. The objective of the T2TASB algorithm is to find the correspondence between Xk1 and Xk2.

In [35], T2TASB is considered as a probability density estimation problem. In this paper, the relative geometry among neighboring local tracks from each sensor is proposed to formulate a maximum likelihood estimation problem with a local track geometrical constraint. Assuming that the local tracks of one sensor are represented by GMM centroids, the corresponding local tracks of the other sensor are fitted to those of the first sensor. Let xt,k1 be the t-th data and xl,k2 be the centroid of the l-th component. That is,

p(xt,k1)=l=1Nk2πt,lkN(xt,k1f(xl,k2),σk2ID, (1)

where N denotes the Gaussian distribution; σk2 denotes the equal isotropic covariance at time k; f denotes the nonrigid transformation; I is the identity matrix; D is the size of a local track vector, and πt,lk is the mixing coefficient at time k with lπt,lk=1. We introduce an indicator Zk=[z1k,z2k,zNk1k], where ztk is a 1×Nk1 binary vector with elements zt,lk for l=1,2,Nk2. The zt,lk satisfy zt,lk{0,1} and lzt,lk=1 conditions. That is, only one element in vector ztk is 1 while all other elements are 0. We have

p(ztkπk)=l=1Nk2πt,lkzt,lk, (2)
p(xt,k1xl,k2,σ2,ztk)=l=1Nk2N(xt,k1f(xl,k2),σk2ID)zt,lk, (3)

where πk={πt,lk}t=1,2,Nk1l=1,2,Nk2. Here, a distribution 1Nk1 with weight w is employed to represents the component of a target detected by sensor 1, but not detected by sensor 2. The relationship between the local track lists xt,k1 and xl,k2 can be given by

p(xt,k1)=w1Nk1+(1w)l=1Nk2πt,lkN(xt,k1f(xl,k2),σk2ID. (4)

The nonrigid transformation f aligns the local tracks, while some nonlinear functions might be employed to approximate it as well. More detail of the nonrigid transformation is given in Appendix A. Here, the displacement function is adopted as [35]

f(Xk2)=Xk2+GkWk, (5)

where Wk is an Nk2×D dimensional weight matrix of the Gaussian kernel; Gk denotes an Nk2×Nk2 Gaussian kernel matrix with elements gij=e12β(xi,k2xj,k2)T(xi,k2xj,k2); and, β denotes the width parameter in the smoothing Gaussian filter. To enforce the smoothness of transformation f, the constraint on the weight matrix Wk can be given by [35,37]:

E(Wk)=Tr(WkTGkWk), (6)

where Tr(.) denotes the trace of a matrix and superscript T denotes transposition. By ignoring the constants independent of {σk2} and {Wk}, the objective function of T2TASB can be written as

Q=D2t=1Nk1l=1Nk2q(zt,lk)logσk2+12σk2t=1Nk1l=1Nk2q(zt,lk)×xt,k1f(xl,k2)Txt,k1f(xl,k2)+αTr(WkTGkWk), (7)

where α controls the trade-off parameter and q(zt,lk) is used to denote p(zt,lk=1Xk1,Xk2).

Consider the membership probability πt,lk in (2), which is assumed to be the same for all components [35]. Here, πt,lk is initialized using the traditional nearest neighbor (NN) method [38] as follows:

(1) If track t from sensor 1 is associated with track l from sensor 2 at time k using NN assignment, we have

πt,ik=τifi=l1τNk21ifil, (8)

where 0τ1 is the confidence in association with the NN method.

(2) If track t from sensor 1 is not associated with any track from sensor 2 at time k using NN assignment, then the uniform membership probability is applied:

πt,lk=1Nk2l(1,2Nk2). (9)

The transformation f uses the relationship between local tracks from different sensors, but does not consider the relative geometry between one track and its neighbors from each sensor. The geometry is inspired by the idea that the relationship between a local track and its neighbors from different sensors could be preserved after the transformation, as depicted in Figure 2. To ensure an accurate T2TA, a geometrical constraint on the local tracks is proposed in this paper. A schematic illustration of the geometrical constraint is given in Figure 2.

Figure 2.

Figure 2

Schematic illustration of the geometrical constraint. (a) with local tracks from sensor 1 and 2, assign neighbors to each local track from its sensor, e.g., the four local tracks around xi2 (b) compute the weights L (c) perform the transformation f with the constraint that each local track xi2 be reconstructed by its neighbors with weights L after the transformation (d) align the Local tracks from sensor 1 and 2 after transformation f by maximizing the objective function.

We desire to preserve the geometry of tracks Xk2 after the nonrigid transformation f. Based on the Euclidean distance between each local track and its neighbors in Xk2, the M nearest neighbors of each local track in Xk2 are obtained. Then, each point in Xk2 is represented as a weighted linear combination of its M nearest neighbors. Let L={Llj}l=1:Nk2j=1:Nk2 be an Nk2×Nk2 weighted matrix. If track state xj,k2 does not belong to the M nearest neighbors of track state xl,k2, then Llj is set to 0. Here, matrix L is obtained by minimizing the following cost function:

e(L)=l=1Nk2xl,k2j=1Nk2Lljxj,k2, (10)

where the sum of each row of L is equal to 1. After the nonrigid transformation, the local track geometry can be preserved by minimizing the transformed cost function:

E(L)=l=1Nk2q(zt,lk)f(xl,k2)j=1Nk2Lljf(xj,k2)2=l=1Nk2q(zt,lk)xl,k2+Gk(l,.)Wj=1Nk2Lij(xj,k2+Gk(j,.)W)2, (11)

where Gk(i,.) is the i-th row of Gk. The objective function of T2TA with sensor bias in (7) is given by

Q1=Q+rE(L)=D2t=1Nk1l=1Nk2q(zt,lk)logσk2+12σk2t=1Nk1l=1Nk2q(zt,lk)×xt,k1f(xl,k2)Txt,k1f(xl,k2)+αTr(WkTGkWk)+γl=1Nk2q(zt,lk)xl,k2+Gk(l,.)Wj=1Nk2Lij(xj,k2+Gk(j,.)W)2, (12)

where γ controls the trade-off between Q and E(L).

3. EM Solution for the Proposed Method

Let Θ={Zk},{σk2},{Wk} be the unknown parameters. To obtain an ML estimate of Θ, the EM algorithm is applied here. There are two steps in the EM algorithm:

  • 1).

    E-step: EL(Θ,Θ(m))=Q1

  • 2).

    M-step: Θ(m+1)=maxEL(Θ,Θ(m)),

where m is the iteration number of the algorithm. The E-step calculates the conditional expectation using the current estimate Θ(m), whereas the M-step provides an updated estimation, Θ(m+1). The estimate of Θ is updated by iterating through these two steps while the complete data likelihood function is maximized.

3.1. E-Step

First, q(zt,lk) can be found using Bayes’ theorem as

q(zt,lk)=πt,lkN(xt,k1f(xl,k2),σk2ID)lπt,lkN(xt,k1f(xl,k2),σk2ID)+w1w×Nk2Nk1. (13)

3.2. M-Step

Then, EL(Θ,Θ(m)) is rewritten as

EL(Θ,Θ(m))=12σk2Tr(Xk1)Tdiag(Rk1)Xk12TrRkTXk1TXk2+GkWk+TrXk2+GkWkTdiag(RkT1)Xk2+GkWk+D2log(σk2)t=1Nk1l=1Nk2q(zt,lk)+αTr(WkTGkWk)+γTr(Xk2)TBkXk2+2γTr(Xk2)TBkGkWk+γTrWkTGkBkGkWk, (14)

where diag(.) indicates diagonal matrix; Rk is an Nk1×Nk2 matrix with elements q(zt,lk) for t=1,2,Nk1, l=1,2,Nk2+1; Bk=ILTdiag(RkT1)IL; 1 represents the all-one column vector of corresponding length; and, I means the identity matrix.

The estimates of σk2 and Wk are iteratively updated by solving the corresponding partial derivative of the expected log likelihood to zero. That is,

EL(Θ,Θ(m))σk2=12σk4Tr(Xk1)Tdiag(Rk1)Xk12TrRkTXk1TXk2+GkWk+TrXk2+GkWkTdiag(RkT1)Xk2+GkWk+Dt=1Nk1l=1Nk2q(zt,lk)2σk2=0. (15)

This results in

σk2=1Dt=1Nk1l=1Nk2q(zt,lk)Tr(Xk1)Tdiag(Rk1)Xk12TrRkTXk1TXk2+GkWk+TrXk2+GkWkTdiag(RkT1)Xk2+GkWk. (16)

Similarly,

EL(Θ,Θ(m))Wk=12σk22GkRkTXk1+2Gkdiag(RkT1)Xk2+2Gkdiag(RkT1)GkWk+2αGkWk+2γGkBkXk2+2γGkBkGkWk=0. (17)

Thus, Wk can be obtained by solving

diag(RkT1)Gk+2ασk2I+2σk2γBkGkWk=RkTXk1diag(RkT1)Xk22σk2γBkXk2. (18)

Here, Ck is used to denote the cost matrix of T2TASB at time k as an Nk1×(Nk2+1) matrix with (t,l) element Ck(t,l) for t=1,2,Nk1, l=1,2,Nk2+1 given by

Ck(t,l)=In(1w)πt,lkN(xt,k1f(xl,k2),σk2ID)ifl=1,2,,Nk2Inw1Nk1ifl=Nk2+1,

where Ck(t,Nk2+1) represents the cost of not making an assignment. The assignment of track t from sensor 1 to track l from sensor 2 can occur only if Ck(t,l)<Ck(t,Nk2+1) for t=1,2,Nk1, l=1,2,Nk2 with Ck(t,Nk2+1) being a gate. If that gate is violated, no assignment option is selected. The solution for the above assignment problem is computed using the Hungarian algorithm [39].

The proposed LTGP method for T2TASB is summarized in Algorithm 1.

Algorithm 1 Proposed LTGP method for T2TASB
Require:
     Local tracks Xk1 and Xk2, parameters w, α, β, γ, M.
  • 1:

    Initialize Wk=0, Rk=INk1×Nk2.

  • 2:

    Search for the M nearest neighbors for each local track in Xk2.

  • 3:

    Find L by minimizing (10).

  • 4:

    while not converged do

  • 5:

    E-step:

  • 6:

       1. Assign membership probability πt,lk using (8) or (9).

  • 7:

       2. Update q(zt,lk) using (13).

  • 8:

    M-step:

  • 9:

       1. Update σk2 based on (16).

  • 10:

       2. Update Wk by solving the linear system in (18).

  • 11:

    end while

Ensure:
     Transformed local track from sensor 2 is f(Xk2)=Xk2+GkWk.
     Association matrix for T2TASB is Ck(t,l) as in (3.2).

4. Computer Simulations

In this section, the performance of the proposed methods is evaluated using simulated data. Thirty targets following a discretized nearly constant velocity motion model [40] are tracked by multiple radar sensors. The initial target positions are randomly generated in the region 100km,100km×100km,100km. The initial velocities of these targets are chosen as [0.5km/s,0.2km/s]. The covariances of the process and measurement noise components are respectively set to diag(104km2,104km2/s2,104km2,104km2/s2), and diag(10−4 km2, 10−5 rad2), where the cross-covariance terms have been ignored in the former [40]. The clutter is generated uniformly over the surveillance region using a Poisson random variable with a mean of 30 at each time step. The sampling period of the measurements is 1s. The number of time steps is 100.

Two radar sensors are considered in the distributed sensor network. The biases in the two sensors are set to η1=[1km,0.017rad]T, and η2=[2km,0.034rad]T. The detection probabilities Pd of both radars are chosen as 0.95. Measurement-to-track association is performed at each sensor without considering the sensor bias. The local tracks from sensor 1 and sensor 2 are illustrated in Figure 3.

Figure 3.

Figure 3

(a) Local tracks from sensor 1 (b) Local tracks from sensor 2.

Parameter τ denotes the confidence in the association by the NN method. Parameter w denotes the initial assumption on the number of false targets detected by sensor 1, but not detected by sensor 2. Parameter β represents the width of the smoothing Gaussian filter in the nonlinear transformation function. Parameter M represents the number of nearest neighbors used in linear reconstruction to preserve the local track structure, while ρ is the parameter in the cross-covariance fusion. We set τ = 0.5, w = 0.2, β = 0.1, M = 10, and ρ = 0.4 throughout this paper.

Parameters α and γ represent the trade-off regularization terms. The ranges of these parameters were determined experimentally. The correct association probability Pc defined as the ratio of the correctly assigned tracks over the total number of tracks is employed as the primary metric for performance evaluation. The variation of Pc with regularization parameters α and γ at time step k = 50 is shown in Figure 4. It is observed that the proposed method performs best when α[5,7] and γ[10,20]. Here, we set α = 6, and γ = 15.

Figure 4.

Figure 4

Model selection of the regularization parameters α and γ. (a) α[0.5,50] and γ[5,100], (b) α[2,10] and γ[5,50].

The proposed LTGP algorithm is used for T2TASB, and the results at time step k = 50 are given in Figure 5, which illustrates that the local tracks from the two sensors are associated correctly by the proposed LTGP method.

Figure 5.

Figure 5

(a) Local tracks at time step k = 50 (b) Local tracks at time step k = 50 after transformation with proposed method.

The performance of the proposed method is demonstrated next relative to those of GNN without registration, the reference pattern-based algorithm [33], and the CPD algorithm [35]. All results are averaged over 50 Monte Carlo runs. The proposed method achieves the best performance, as illustrated in Figure 6. Compared to the reference pattern-based algorithm, the CPD algorithm improves the Pc by about 8%. The Pc of the proposed method has improved by 5% as compared with the CPD algorithm. Furthermore, the results for a scenario with varying detection probabilities and different numbers of targets are respectively illustrated in Figure 7 and Figure 8. It is observed that the proposed algorithm outperforms the other three benchmark algorithms. From Figure 7, the performance of GNN without registration, reference pattern-based algorithm, and CPD algorithm degrade rapidly with a decreased detection probability. From Figure 8, the performance of the proposed method is almost constant while increasing the number of targets. Moreover, the average Pc of the proposed method is improved by approximately 9% as compared with the CPD algorithm.

Figure 6.

Figure 6

Correct association probabilities of GNN without registration, reference pattern-based algorithm, CPD algorithm and the proposed method.

Figure 7.

Figure 7

Correct association probabilities of GNN without registration, reference pattern-based algorithm, CPD algorithm and the proposed method for different detection probabilities.

Figure 8.

Figure 8

Correct association probabilities of GNN without registration, reference pattern-based algorithm, correlation-based algorithm and the proposed method for a detection probabilities of Pd=0.9 when target cardinality changes within a fixed surveillance region.

The computational complexities of the proposed LTGP algorithm are analyzed next. For simplicity, the same number of local tracks N for each sensor every time is considered. At each time step in the LTGP algorithm, the computational complexity to search the M nearest neighbors for each local track in Xk2 is O((M+N)logN), using the k-d tree [41]; the computational complexity to obtain matrix L is O(M3N); and, the complexity of the EM algorithm is almost O(N3) [42]. The computational complexity at each time step in the LTGP algorithm is O(N3). Therefore, the total computational complexity of the proposed LTGP algorithm is O(N3K), where K is the total number of measurement samples.

5. Experiments on KITTI Dataset

In this section, we evaluate the proposed algorithm using the KITTI dataset. Here, the KITTI multi-object tracking dataset [43] is applied to evaluate the proposed data association method. The vehicle tracking test sequences 01 and 20, and pedestrian tracking test sequences 16 and 17 are used. Each sequence consists of 30 frames. Figure 9 depicts the starting frames of left and right cameras for each sequence along with the results of object detection. For the left camera, the detection results of vehicle or pedestrian are provided by the ground truth. Meanwhile, the deformable part model detector method [44] is proposed to detect vehicle or pedestrian for images of rthe ight camera.

Figure 9.

Figure 9

Typical frames of the KITTI dataset [43]. (a) Sequence 01 starting frame #34. (b) Sequence 20 starting frame #31. (c) Sequence 16 starting frame #63. (d) Sequence 17 starting frame #23.

The ground truth matching between the left and right images in each frame is confirmed by manual annotation. The GNN without registration, the reference pattern-based algorithm, the CPD algorithm, and the proposed method are employed to associate the local tracks. The average T2TA matching accuracy performances of different T2TA methods are depicted in Figure 10. It is confirmed that the performance of the proposed method is substantially better than those of GNN without registration, the reference pattern-based algorithm, and the CPD algorithm. Compared with the CPD algorithm, the average performance of the proposed method is improved by about 7.8%. In addition, since the KITTI sequence 17 contains large pedestrian occlusion while the motion is more than the other sequences, the performance gap between this and other sequences is more evident. The proposed LTGP method has better performance compared to three benchmark algorithms in the KITTI sequence 17. It is because the proposed method preserves the geometry of local tracks in the data association. The average run-times of these algorithms are given in Table 1, which reveals that the proposed LTGP method has higher computational complexity compared to the GNN without registration, reference pattern-based, and CPD methods.

Figure 10.

Figure 10

Track-to-track association matching accuracy of the GNN without registration, reference pattern-based algorithm, CPD algorithm and proposed LTGP method under different sequences of KITTI dataset.

Table 1.

The average run-times of these algorithms in four KITTI sequences

Method GNN without Registration (s) Reference Pattern-Based (s) CPD(s) LTGP(s)
Sequence
KITTI_01 0.0004 0.0008 0.0094 0.0159
KITTI_20 0.0004 0.0013 0.0102 0.0169
KITTI_16 0.0008 0.0015 0.0111 0.0176
KITTI_17 0.0004 0.0009 0.0104 0.0163

6. Conclusions

A probabilistic method for the track-to-track association, namely, LTGP, was proposed in this paper. In the LTGP method, one local track was transformed into another local track using a nonlinear function. We utilized k-connected neighbors to preserve the relative local track geometry. The T2TASB problem was formulated as a probability density estimation problem. The EM algorithm was used to fuse biased tracks from two sensors. To illustrate the advantages of the proposed method, some experiments of computer simulation and KITTI dataset were performed and the result is compared with GNN without registration, reference pattern-based algorithm, and CPD algorithm. Experiments on computer simulation involve varying detection probabilities and different numbers of targets, the proposed method has better performance than other algorithms for all detection probabilities and numbers of targets, but it has higher computational complexity. In the KITTI dataset, the proposed LTGP method has better performance than other methods. The T2TA matching accuracy of the proposed LTGP method was improved by about 7.8% as compared with the CPD method. From the experimental results of computer simulation and KITTI dataset, it can be concluded that the proposed LTGP method outperforms the GNN without registration algorithm, the reference pattern-based algorithm, and the CPD algorithm, but it has a higher computational load.

In the future, the proposed method is not restricted to the considered application but can be extended to other tasks, such as multi-sensor T2TASB for the connected vehicle. For the multi-sensor T2TASB scenario, the LTGP method can be extended using sequential processing.

Acknowledgments

The authors gratefully acknowledge the Autonomous Vision Group for providing the KITTI dataset. The authors also would like to thank the editors and referees for the valuable comments and suggestions.

Abbreviations

T2TA Track-to-track association
T2TASB Track-to-track association with sensor bias
LTGP Local track geometry preservation
GMM Gaussian mixture model
EM Expectation-maximization
NN Nearest neighbor
GNN Global nearest neighbor
ML Maximum likelihood
OSPA Optimal sub-pattern assignment
CPD Coherent point drift
Xks Local tracks from sensor s at time k
K Total number of discrete time steps
Nks Number of tracks at time k by sensor s
xt,k1 t-th data from sensor 1 at time k
xl,k2 Centroid of the l-th component from sensor 2 at time k
N Gaussian distribution
σk2 Equal isotropic covariance at time k
f Nonrigid transformation
I Identity matrix
D Size of a local track vector
πt,lk Membership probability of t-th row and l-th column element in πk at time k
πk Membership probability matrix at time k
Zk Indicator matrix
ztk a 1×Nk1 binary vector for l=1,2,Nk2 at time k
zt,lk t-th row and l-th column element in ztk at time k
Wk an Nk2×D dimensional weight matrix of the Gaussian kernel
Gk an Nk2×Nk2 Gaussian kernel matrix
gij an i-th row and j-th column element in Gk
β the width parameter in the smoothing Gaussian filter
Tr(.) Trace of a matrix
L Nk2×Nk2 weighted matrix
Llj a l-th row and j-th column element in L
Gk(i,.) i-th row of Gk
γ Trade-off parameter controlling between Q and E(L)
Rk an Nk1×Nk2 matrix
Ck Cost matrix of T2TASB at time k as an Nk1×(Nk2+1) matrix
[xt,k1,yt,k1] x-axis and y-axis positions of target xt,k1
[xj,k1,yj,k2] x-axis and y-axis positions of target xj,k2

Appendix A. The Relationship of Two Local Tracks

Assume that a target is detected by two sensors and that the two sensors are located at the coordinate origins. Due to the sensor bias, the target (x,y) is reported as xt,k1=[xt,k1,yt,k1] and xj,k2=[xj,k2,yj,k2] by the two sensors, respectively. By ignoring the random noises, we have [35]

xt,k1=(Υt,k1+ΔΥ1)cos(θt,k1+Δθ1), (A1)
yt,k1=(Υt,k1+ΔΥ1)sin(θt,k1+Δθ1), (A2)
xj,k2=(Υj,k2+ΔΥ2)cos(θj,k2+Δθ2), (A3)
yj,k2=(Υj,k2+ΔΥ2)sin(θj,k2+Δθ2), (A4)

where Υi,ks and θi,ks denote the real range and angle measurement of target i for sensor s, respectively. The ΔΥs and Δθs represent the range bias and angle bias, respectively. Therefore,

xt,k1×Υt,k1Υt,k1+ΔΥ1=xcosΔθ1ysinΔθ1, (A5)
yt,k1×Υt,k1Υt,k1+ΔΥ1=xsinΔθ1+ycosΔθ1, (A6)
xj,k2×Υj,k2Υj,k2+ΔΥ2=xcosΔθ2ysinΔθ2, (A7)
yj,k2×Υj,k2Υj,k2+ΔΥ2=xsinΔθ2+ycosΔθ2. (A8)

From (A5)–(A8), we get

xj,k2yj,k2=Υt,k1(Υj,k2+ΔΥ2)Υj,k2(Υt,k1+ΔΥ1)cosθ12sinθ12sinθ12cosθ12xt,k1yt,k1, (A9)

where θ12=Δθ1Δθ2. Equation (A9) gives the relationship between the local tracks xt,k1 and xj,k2. In this paper, a non-rigid transformation as (5) is proposed to approximate the relationship from one local tracks to other local tracks.

Author Contributions

Major framework for the review, K.Z., H.Z. and Y.L.; software, K.Z.; writing–original draft preparation, K.Z. and H.Z.; writing–review and editing, A.D.F.; supervision, H.E.N. All authors have read and agreed to the published version of the manuscript.

Funding

This work is jointly supported by the Research Funds of Chongqing Science and Technology Commission (Grant No. cstc2017jcyjAX0293, No. CSTC2017JCYJBX0018, and No. CX2017044), by the National Natural Science Foundation of China (Grant No. 61773082), by the Key Project of Crossing and Emerging Area of CQUPT (Grant No. A2018-02), by the Research Fund of young-backbone university teacher in Chongqing province, by Chongqing Overseas Scholars Innovation Program, by Wenfeng Talents of Chongqing University of Posts and Telecommunications, by Innovation Team Project of Chongqing Education Committee (Grant No. CXTDX201601019), by the National Key Research and Development Program (Grant No. 2016YFB0100906), by the Research and Innovation of Chongqing Postgraduate Project (Grant No. CYS19274), by the Lilong Innovation and Entrepreneurship Fund of Chongqing University of Posts and Telecommunications (Grant No. 2019-1-01).

Conflicts of Interest

The authors declare no conflict of interest.

References

  • 1.Zhu H., Yuen K., Mihaylova L., Leung H. Overview of Environment Perception for Intelligent Vehicles. IEEE Trans. Intell. Transp. Syst. 2017;18:2584–2601. doi: 10.1109/TITS.2017.2658662. [DOI] [Google Scholar]
  • 2.Li Y., Chen W., Peeta S., Wang Y. Platoon Control of Connected Multi-Vehicle Systems Under V2X Communications: Design and Experiments. IEEE Trans. Intell. Transp. Syst. 2019 doi: 10.1109/TITS.2019.2905039. [DOI] [Google Scholar]
  • 3.Zhu H., Zou K., Li Y., Cen M., Mihaylova L. Robust Non-Rigid Feature Matching for Image Registration Using Geometry Preserving. Sensors. 2019;19:2729. doi: 10.3390/s19122729. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Zhu H., Guo B., Zou K., Li Y., Yuen K., Mihaylova L. A Review of Point Set Registration: From Pairwise Registration to Groupwise Registration. Sensors. 2019;19:1191. doi: 10.3390/s19051191. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Li Y., Tang C., Peeta S., Wang Y. Integral-Sliding-Mode Braking Control for a Connected Vehicle Platoon: Theory and Application. IEEE Trans. Ind. Electron. 2019;66:4618–4628. doi: 10.1109/TIE.2018.2864708. [DOI] [Google Scholar]
  • 6.Li Y., Tang C., Peeta S., Wang Y. Nonlinear Consensus-Based Connected Vehicle Platoon Control Incorporating Car-Following Interactions and Heterogeneous Time Delays. IEEE Trans. Intell. Transp. Syst. 2019;20:2209–2219. doi: 10.1109/TITS.2018.2865546. [DOI] [Google Scholar]
  • 7.Kim D., Jo K., Lee M., Sunwoo M. L-Shape Model Switching-Based Precise Motion Tracking of Moving Vehicles Using Laser Scanners. IEEE Trans. Intell. Transp. Syst. 2018;19:598–612. doi: 10.1109/TITS.2017.2771820. [DOI] [Google Scholar]
  • 8.Ji X., Zhang G., Chen X., Guo Q. Multi-Perspective Tracking for Intelligent Vehicle. IEEE Trans. Intell. Transp. Syst. 2018;19:518–529. doi: 10.1109/TITS.2017.2784486. [DOI] [Google Scholar]
  • 9.Zhang Y., Song B., Du X., Guizani M. Vehicle Tracking Using Surveillance With Multimodal Data Fusion. IEEE Trans. Intell. Transp. Syst. 2018;19:2353–2361. doi: 10.1109/TITS.2017.2787101. [DOI] [Google Scholar]
  • 10.Alessandretti G., Broggi A., Cerri P. Vehicle and Guard Rail Detection Using Radar and Vision Data Fusion. IEEE Trans. Intell. Transp. Syst. 2007;8:95–105. doi: 10.1109/TITS.2006.888597. [DOI] [Google Scholar]
  • 11.Singer R., Kanyuck A. Computer control of multiple site track correlation. Automatica. 1971;7:455–463. doi: 10.1016/0005-1098(71)90096-3. [DOI] [Google Scholar]
  • 12.Sorenson H.W. An efficient track association algorithm for the multitarget tracking problem. Inf. Sci. 1980;21:241–260. doi: 10.1016/0020-0255(80)90033-X. [DOI] [Google Scholar]
  • 13.Bar-Shalom Y. On the track-to-track correlation problem. IEEE Trans. Autom. Control. 1981;26:571–572. doi: 10.1109/TAC.1981.1102635. [DOI] [Google Scholar]
  • 14.He Y., Zhang J. New track correlation algorithms in a multisensor data fusion system. IEEE Trans. Aerosp. Electron. Syst. 2006;42:1359–1371. [Google Scholar]
  • 15.Chong C.Y., Mori S. Metrics for Feature-Aided Track Association; Proceedings of the 9th International Conference on Information Fusion; Florence, Italy. 10–13 July 2006; pp. 1–8. [Google Scholar]
  • 16.Mori S., Chang K.C., Chong C.Y. Performance prediction of feature-aided track-to-track association. IEEE Trans. Aerosp. Electron. Syst. 2014;50:2593–2603. doi: 10.1109/TAES.2014.120687. [DOI] [Google Scholar]
  • 17.Kaplan L.M., Bar-Shalom Y., Blair W.D. Assignment costs for multiple sensor track-to-track association. IEEE Trans. Aerosp. Electron. Syst. 2008;44:655–677. doi: 10.1109/TAES.2008.4560213. [DOI] [Google Scholar]
  • 18.Kragel B., Herman S., Roseveare N. A Comparison of Methods for Estimating Track-to-Track Assignment Probabilities. IEEE Trans. Aerosp. Electron. Syst. 2012;48:1870–1888. doi: 10.1109/TAES.2012.6237567. [DOI] [Google Scholar]
  • 19.Liu X., Yin H., Liu H.Y., Wu Z.M. Performance of track-to-track association algorithms based on Mahalanobis distance. Sens. Transducers. 2013;22:58–65. [Google Scholar]
  • 20.Bar-Shalom Y., Chen H. Multisensor track-to-track association for tracks with dependent errors; Proceedings of the 2004 IEEE Conference on Decision and Control; Nassau, Bahamas. 14–17 December 2004; pp. 2674–2679. [Google Scholar]
  • 21.Osbome R.W., Bar-Shalom Y., Willett P. Track-to-track association with augmented state; Proceedings of the 2011 International Conference on Information Fusion; Chicago, IL, USA. 5–8 July 2011; pp. 1–8. [Google Scholar]
  • 22.Bar-Shalom Y., Chen H. Track-to-Track Association Using Attributes. J. Adv. Inf. Fusion. 2007;2:49–59. [Google Scholar]
  • 23.Song T.L., Musicki D. Smoothing innovations and data association with IPDA. Automatica. 2012;48:1324–1329. doi: 10.1016/j.automatica.2012.04.001. [DOI] [Google Scholar]
  • 24.Dencux T., Zoghby N.E., Cherfaoui V., Jouglet A. Optimal Object Association in the Dempster-Shafer Framework. IEEE Trans. Cybern. 2014;44:2521–2531. doi: 10.1109/TCYB.2014.2309632. [DOI] [PubMed] [Google Scholar]
  • 25.Lee K.H., Kanzawa Y., Derry M., Jamesy M.R. Multi-Target Track-to-Track Fusion Based on Permutation Matrix Track Association; Proceedings of the 2018 IEEE Conference on Intelligent Vehicles Symposium; Changshu, China. 26–30 July 2018; pp. 465–470. [Google Scholar]
  • 26.Zhao H., Sha J., Zhao Y., Xi J., Cui J., Zha H., Shibasaki R. Detection and Tracking of Moving Objects at Intersections Using a Network of Laser Scanners. IEEE Trans. Intell. Transp. Syst. 2012;13:655–670. doi: 10.1109/TITS.2011.2175218. [DOI] [Google Scholar]
  • 27.Wang X., Xu L., Sun H., Xin J., Zheng N. On-Road Vehicle Detection and Tracking Using MMW Radar and Monovision Fusion. IEEE Trans. Intell. Transp. Syst. 2016;17:2075–2084. doi: 10.1109/TITS.2016.2533542. [DOI] [Google Scholar]
  • 28.Jo K., Lee M., Sunwoo M. Track Fusion and Behavioral Reasoning for Moving Vehicles Based on Curvilinear Coordinates of Roadway Geometries. IEEE Trans. Intell. Transp. Syst. 2018;19:3068–3075. doi: 10.1109/TITS.2017.2759904. [DOI] [Google Scholar]
  • 29.Papageorgiou D.J., Sergi J.D. Simultaneous Track-to-Track Association and Bias Removal Using Multistart Local Search; Proceedings of the IEEE Aerospace Confence; Big Sky, MT, USA. 1–8 March 2008; pp. 1–14. [Google Scholar]
  • 30.Okello N.N., Challa S. Joint sensor registration and track-to-track fusion for distributed tracks. IEEE Trans. Aerosp. Electron. Syst. 2004;40:808–823. doi: 10.1109/TAES.2004.1337456. [DOI] [Google Scholar]
  • 31.Huang D., Leung H., Bosse E. A Pseudo-Measurement Approach to Simultaneous Registration and Track Fusion. IEEE Trans. Aerosp. Electron. Syst. 2012;48:2315–2331. doi: 10.1109/TAES.2012.6237594. [DOI] [Google Scholar]
  • 32.Zhu H., Leung H., Yuen K.V. A joint data association, registration, and fusion approach for distributed tracking. Inf. Sci. 2015;324:186–196. doi: 10.1016/j.ins.2015.06.042. [DOI] [Google Scholar]
  • 33.Tian W., Wang Y., Shan X., Yang J. Track-to-Track Association for Biased Data Based on the Reference Topology Feature. IEEE Signal Process. Lett. 2014;21:449–453. doi: 10.1109/LSP.2014.2305305. [DOI] [Google Scholar]
  • 34.Tian W. Reference pattern-based track-to-track association with biased data. IEEE Trans. Aerosp. Electron. Syst. 2016;52:501–512. doi: 10.1109/TAES.2015.140433. [DOI] [Google Scholar]
  • 35.Zhu H., Wang M., Yuen K.V., Leung H. Track-to-Track Association by Coherent Point Drift. IEEE Signal Process. Lett. 2017;24:643–647. doi: 10.1109/LSP.2017.2682857. [DOI] [Google Scholar]
  • 36.Wan Y., Puig V., Ocampo-Martinez C., Wang Y., Harinath E., Braatz R.D. Fault detection for uncertain LPV systems using probabilistic set-membership parity relation. J. Process. Control. 2020;87:27–36. doi: 10.1016/j.jprocont.2019.12.010. [DOI] [Google Scholar]
  • 37.Myronenko A., Song X. Point Set Registration: Coherent Point Drift. IEEE Trans. Pattern Anal. Mach. Intell. 2010;32:2262–2275. doi: 10.1109/TPAMI.2010.46. [DOI] [PubMed] [Google Scholar]
  • 38.Singer R.A., Stein J.J. An optimal tracking filter for processing sensor data of imprecisely determined origin in surveillance systems; Proceedings of the 1971 IEEE Conference on Decision and Control; Miami Beach, FL, USA. 15–17 December 1971; pp. 171–175. [Google Scholar]
  • 39.Kuhn H.W. The Hungarian Method for the Assignment Problem. Nav. Res. Logist. 1995;2:83–97. doi: 10.1002/nav.3800020109. [DOI] [Google Scholar]
  • 40.Bar-Shalom Y., Li X., Kirubarajan T. Estimation, Tracking and Navigation: Theory, Algorithms and Software. John Wiley & Sons; New York, NY, USA: 2001. [Google Scholar]
  • 41.Bentley J.L. Multidimensional Binary Search Trees Used for Associative Searching. Commun. ACM. 1975;18:509–517. doi: 10.1145/361002.361007. [DOI] [Google Scholar]
  • 42.Zhu H., Leung H., He Z. State Estimation in Unknown Non-Gaussian Measurement Noise using Variational Bayesian Technique. IEEE Trans. Aerosp. Electron. Syst. 2013;49:2601–2614. doi: 10.1109/TAES.2013.6621839. [DOI] [Google Scholar]
  • 43.Geiger A. Ph.D. Thesis. University of Tübingen; Tübingen, Germany: 2013. Probabilistic Models for 3D Urban Scene Understanding from Movable Platforms. [Google Scholar]
  • 44.Felzenszwalb P.F., Girshick R.B., McAllester D., Ramanan D. Object Detection with Discriminatively Trained Part-Based Models. IEEE Trans. Pattern Anal. Mach. Intell. 2010;32:1627–1645. doi: 10.1109/TPAMI.2009.167. [DOI] [PubMed] [Google Scholar]

Articles from Sensors (Basel, Switzerland) are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES