Robust visual tracking via self-similarity learning

Self-similarity is an attractive image property that has been successfully applied to object recognition due to its robustness to severe target appearance variations. However, less attention has been paid to explore self-similarity for visual tracking, mainly because it is difficult to learn self-similarity information between different features suitable for visual tracking. To address this issue, a simple, yet effective approach is presented to learn self-similarity information among the local features extracted from the different regions of the target. The target is first divided into some non-overlapping regions, in which each region is described by the histogram of gradient (HOG) features. Then, an explicit polynomial kernel feature map is constructed, which is capable of characterising the self-similarity information among all the local regions in the targets. Finally, based on the feature maps, a linear support vector machine (SVM) is learnt via an online dual coordinate descent method that offers fast convergence guarantee. Experiments on a large tracking benchmark dataset with 50 sequences demonstrate the superiority of the proposed method over state-of-the-art methods.

[1]  Jin Gao,et al.  Transfer Learning Based Visual Tracking with Gaussian Processes Regression , 2014, ECCV.

[2]  Rui Caseiro,et al.  High-Speed Tracking with Kernelized Correlation Filters , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Huchuan Lu,et al.  Robust object tracking via sparsity-based collaborative model , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Chia-Hua Ho,et al.  Recent Advances of Large-Scale Linear Classification , 2012, Proceedings of the IEEE.

[5]  Dit-Yan Yeung,et al.  Learning a Deep Compact Image Representation for Visual Tracking , 2013, NIPS.

[6]  Stan Sclaroff,et al.  MEEM: Robust Tracking via Multiple Experts Using Entropy Minimization , 2014, ECCV.

[7]  Philip H. S. Torr,et al.  Struck: Structured output tracking with kernels , 2011, ICCV.

[8]  Eli Shechtman,et al.  Matching Local Self-Similarities across Images and Videos , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Yi Wu,et al.  Online Object Tracking: A Benchmark , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Zhongfei Zhang,et al.  A survey of appearance models in visual object tracking , 2013, ACM Trans. Intell. Syst. Technol..

[11]  Lei Zhang,et al.  Fast Compressive Tracking , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Dit-Yan Yeung,et al.  Understanding and Diagnosing Visual Tracking Systems , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[13]  Qingshan Liu,et al.  Robust Visual Tracking via Convolutional Networks Without Training , 2016, IEEE Transactions on Image Processing.

[14]  Michael Felsberg,et al.  Accurate Scale Estimation for Robust Visual Tracking , 2014, BMVC.

[15]  Huchuan Lu,et al.  Visual tracking via adaptive structural local sparse appearance model , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Thomas Deselaers,et al.  Global and efficient self-similarity for object classification and detection , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.