Abstract
In recent years, the visual object tracking has drawn increasing interests. There are many applications, e.g., video surveillance in airports, schools, hospitals and traffic. The object surveillance may provide crucial information about the behavior, interaction, and relationship between objects of interest. This paper addresses issues in object tracking where videos contain complex scenarios. We propose an adaptive particle filters tracking scheme with exquisite resampling (AERPF), which improves prediction, importance sampling and resampling. In prediction step, an adaptive strategy for search region and particle number is addressed for object disappearing or obstacle disturbance, which can obtain results more effectively. In addition, in importance sampling, we use optical flow to refine the particle weights using the dynamical object motion information, which results the better accuracy of object location updating. Moreover, exquisite resampling (ER) algorithm can be applied for reflecting more the posterior probability density function of true state. The proposed method can be applied for object tracking both on fixed and active camera, handling partial occlusion and full occlusion problem properly. As a result, it outperforms other existing methods.
You have full access to this open access chapter, Download conference paper PDF
Similar content being viewed by others
Keywords
1 Introduction
Video object tracking is an important topic within the field of computer vision. It has a wide range of applications such as human-vehicle navigation, computer interaction, etc. Various approaches for object tracking have been proposed. Reference [1] proposed a tracking method based on mean shift. It maximizes the similarity iteratively by comparing the color histogram of the object. The advantage is the elimination of a brute force search and low computation. Reference [2] extended to 3D domain, combines color and spatial information to solve the problems of orientation changing and small scale changing. Reference [3] used stochastic meta-descent optimization method. It can track fast moving objects with significant scale change in a low-frame-rate video.
Template matching is a common and direct tracking method. It finds the position of object by minimize the error with a predefined object template. [4] used previous frame to adapt the object template, which solves the problems of appearance changing during the movement. Reference [5] proposed a template updating algorithm that avoids the drifting inherent in the naive algorithm.
Particle filter is based on Monte Carlo theorem. It estimates state by posterior probability, commonly used in pattern recognition and object tracking, such as [6]. Improved from [7], this paper proposed an adaptive particle filters tracking algorithm scheme with exquisite resampling (AERPF). We will introduce it in the following section.
2 Adaptive Exquisitie Resampling Particle Filter
In this section, we will illustrate the proposed algorithm in detail. Figure 1 is the flow chart of AERPF with the basic three stages: prediction, importance sampling and resampling.
2.1 Particle Filter
Extended from Kalman filter, particle filter can be applied in both linear and non-linear problems. Suppose we have a system described by the equations:
The location of object being tracked is a state vector Ă— which cannot be observed directly. We use a dynamic model of the state to predict how it evolves over time. Vector y represent the feature observed from location Ă—. We can use the observation to correct the estimate of the state.
2.2 Prediction
The first stage of particle filter is prediction stage. When object disappears, instead of randomly spreading particles, we radially spread particles from where object disappeared because of the assumption that the object will not move faraway immediately. If the object is temporarily occluded, the way we spread particles research the target more efficiently than searching globally. While in long-term occlusion, we have already spread particles globally and this can avoid missing the object.
Then, use the motion vector obtained from optical flow to adjust the diffusion range. A high standard deviation of the motion vector indicates the object moves drastically, hence we need to enlarge the diffusion range as Fig. 2(a). A low standard deviation indicates moving consistency, so the diffusion range could be shrunk, as Fig. 2(b).
In addition to diffusion range, we can also predict the moving direction by motion vector. It is reasonable that the object moves toward the same direction according to the last few seconds, as a result, we spread the particles toward the same direction if moving direction has consistency.
2.3 Importance Sampling
The Second stage of particle filter is importance sampling. At this stage, the objective is to give each particle a weight, preserve the more important and eliminate the less important particles according to their weights.
Color histogram of the target model is used as feature to determine the weights. The histograms are calculated in the RGB space using 8Â Ă—Â 8Â Ă—Â 8 bins. We established a target model \( q = \left\{ {q^{(u)} } \right\} \) with \( \sum_{u = 1}^{N} q^{(u)} \; = \;1 \) and \( p(x)\; = \;\{ p^{(u)} (x)\} \) with \( \sum_{u = 1}^{N} \,p^{(u)} \; = \;1 \) for the candidate. Because boundary pixels might belong to the background or get occluded easily, the center of the object is considered more important than the boundary. Hence the kernel function is used to assign small weights to the pixels further away from the region center:
where x is the distance of the pixel locations from the center.
After obtaining the original weights by calculating their Bhattacharyya coefficients, we take two steps to refine them. Optical flow [8] is the apparent motion of brightness patterns in the image. Ideally, it would be the same as the motion field. Calculating the average of motion vector (4) obtained from optical flow, we can predict a new center from the last center. Promoting the weights of particles around the center which optical predicts is the first step.
The second step is to set a threshold. Low-weight particles decrease accuracy, to avoid it, we hope to eliminate those less important. A suitable measure of degeneracy of the algorithm is the effective sample size Neff introduced in [6]. Using (5) to obtain the the effective samples, choose the lowest weight in those samples to set the threshold. The weight which is lower than the threshold is set to be zero (6).
When all the weights are small and set to zeros, means all the particles in the whole frame are not similar to the target, in other words, there exists no object.
2.4 Resampling
The goal in this stage is to eliminate particles with small weights, concentrate on particles with large weights for the prediction stage for next time. But the original resampling algorithm causes some defects. We use Fig. 3 to illustrate the condition.
N is the total number of particles. \( \left\{ {C_{i} } \right\}^{N} \;\;_{i = 1} \) represents the cumulative sum of weights. \( \{ U_{i} \}^{N} \;\;_{i = 1} \) is a sequence of random variable which is uniformly distributed in the interval [0,1].We view Ui as a threshold, the CDF crossing over it is considered the more important one. As shown in Fig. 1, Because C2, C4 and C5 cross the threshold U2, U4 and U5, they are reserved for the next stage of prediction. But, we can clearly see that actually the weight of particle 2 is smaller than particle 1, and the weight of particle 4 is smaller than particle 3.This defect will lead to decreasing of estimation accuracy.
We adopt exquisite resampling [9] to overcome this defect. First, implement the same procedure mentioned above until particle 2 pierces the threshold. Then go back to check all the particles in this interval, and preserve the one with highest weight. So C1, C3 will be preserved instead. By this the true state of the pdf can be more accurately reflected.
To prove the improvement, we use the following nonlinear model as an example. The dynamic state space models are given by the following equations:
where vk and wk are nonzero mean Gaussian random variables, x0 = 1, α = 0.5, β = 25, γ = 8, sample numbers = 100, Time step = 50 s.
3 Experimental Results
Table 1 shows the comparison of AERPF with original PF. Experiment 1 ~ 3 are sequence by fixed camera, and experiment 4 ~ 6 are by active camera. From the tracking results, we can see the promotion of the accuracy and F-score. At the same time, AERPF decrease the mean error, which means it can enhance the recognition of target object.
References
Comaniciu, D., Ramesh, V., Meer, P.: Kernel-based object tracking. IEEE Trans. Pattern Anal. Mach. Intell. 25(5), 564–577 (2003)
Zhao, Q., Hai, T.: Object tracking using color correlogram. In: The 2nd Joint IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance (2005)
Li, Z., Chen, J., Schraudolph, N.N.: An improved mean-shift tracker with kernel prediction and scale optimisation targeting for low-frame-rate video tracking. In: The 19th International Conference on Pattern Recognition, pp. 1–4 (2008)
Lucas, B.D., Kanade, T.: An iterative image registration technique with an application to stereo vision. In: The 7th International Joint Conference on Artificial Intelligence, vol. 81, pp. 674–679 (1981)
Matthews, I., Ishikawa, T., Baker, S.: The template update problem. IEEE Trans. Pattern Anal. Mach. Intell. 26(6), 810–815 (2004)
Arulampalam, M.S.: A tutorial on particle filters for online nonlinear/non-gaussian bayesian tracking. IEEE Trans. Signal Process. 50(2), 174–188 (2002)
Vermaak, J., Godsill, S.J., Perez, P.: Monte carlo filtering for multi target tracking and data association. IEEE Trans. Aerosp. Electron. Syst. 41(1), 309–332 (2005)
Horn, K., Schunck, B.G.: Determining optical flow. Artif. Intell. 17(1), 185–203 (1981)
Fu, X., Jia, Y.: An improvement on resampling algorithm of particle filters. IEEE Trans. Signal Process. 58(10), 5414–5420 (2010)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Dung, LR., Huang, YC., Huang, RY., Wu, YY. (2015). An Adaptive Particle Filtering for Solving Occlusion Problems of Video Tracking. In: Stephanidis, C. (eds) HCI International 2015 - Posters’ Extended Abstracts. HCI 2015. Communications in Computer and Information Science, vol 528. Springer, Cham. https://doi.org/10.1007/978-3-319-21380-4_114
Download citation
DOI: https://doi.org/10.1007/978-3-319-21380-4_114
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-21379-8
Online ISBN: 978-3-319-21380-4
eBook Packages: Computer ScienceComputer Science (R0)