Fast Structured Tracker with Improvedmotionmodel using robust Kalman filter

In this chapter, we discuss the problem of tracking objects without having to learn a model of the object’s appearance in advance (i.e., model-free tracking). Object tracking has many practical applications, which motivated continous developments in the field. A particularly successful tracking technique that has been recently proposed is the structured tracker called Struck. Struck uses support vector machines to learn the object’s appearance as well as to predict its location from one video frame to the next. However, tracking remains a challenging problem due to change of lighting conditions throughout the video, change of scale, rotation of the object of the interest, and occlusions. Here, we present a tracker that is based on Struck. Structured learning is a suitable framework for tracking as it calculates a discriminative function mapping the appearance information from one video frame onto the bounding box containing the tracking object in another video frame. We discuss a number of concepts that help keep limit computational complexity throughout tracking: budget for support vectors and fast computation of the intersection kernel. We then show how the use of the Robust Kalman filter can help smooth tracking results, and consequently make the tracker robust to false-negative detections. We show how the Robust Kalman Filter helps detect short-time occlusions by classifying them as outliers. We discuss the tracking evaluation using different evaluation protocols, and present experimental results that show that the structured tracker with the robust filter improves upon original Struck tracker regardless of the features or kernels used.

[1]  Arnold W. M. Smeulders,et al.  Fast occluded object tracking by a robust appearance filter , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Subhransu Maji,et al.  Efficient Classification for Additive Kernel SVMs , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  David A. McAllester,et al.  A discriminatively trained, multiscale, deformable part model , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Thomas Hofmann,et al.  Support vector machine learning for interdependent and structured output spaces , 2004, ICML.

[5]  Ales Leonardis,et al.  Visual Object Tracking Performance Measures Revisited , 2015, IEEE Transactions on Image Processing.

[6]  John C. Platt,et al.  Fast training of support vector machines using sequential minimal optimization, advances in kernel methods , 1999 .

[7]  Koby Crammer,et al.  Multi-Class Pegasos on a Budget , 2010, ICML.

[8]  Thomas Hofmann,et al.  Large Margin Methods for Structured and Interdependent Output Variables , 2005, J. Mach. Learn. Res..

[9]  Antoine Bordes,et al.  Sequence Labelling SVMs Trained in One Pass , 2008, ECML/PKDD.

[10]  Peter Ruckdeschel,et al.  Optimally Robust Kalman Filtering , 2010 .

[11]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[12]  Ming-Hsuan Yang,et al.  Visual tracking with online Multiple Instance Learning , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Yi Wu,et al.  Online Object Tracking: A Benchmark , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[14]  Jason Weston,et al.  Solving multiclass support vector machines with LaRank , 2007, ICML '07.

[15]  Huchuan Lu,et al.  Robust object tracking via sparsity-based collaborative model , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Rui Caseiro,et al.  Exploiting the Circulant Structure of Tracking-by-Detection with Kernels , 2012, ECCV.