Target Tracking via Incorporating Multi-modal Features

The challenge of visual tracking is to develop a robust target’s appearance model, the core of which involves an appropriate selection and an effective assembly of a cluster of features. In this paper, we propose a novel model to adaptively choose reasonable combination of feature sets to represent the target by employing the multi-kernel ridge regression. This model will update the weights distributions of different kernel groups for feature sets automatically and the regularization parameter’s value of the kernel regression objective function as well. For traditional multi-kernel based algorithm would cost too much time on training model, we develop a very simple and efficient algorithm by adapting feature sets to circulant structure so as to make use of the Fast Fourier Transform (FFT). Thus our algorithm can provide more robust tracking while maintaining real-time effects. To the best of our knowledge, this is the first time the multiple kernel learning algorithms is applied to real-time visual tracking. We evaluate the proposed algorithm on the popular benchmark including 50 image sequences and compare it with 9 state-of-art methods. Implemented in Matlab, the experiment results show that the proposed tracker runs at 45.4 frames per second on an i3 machine and outperforms the state-of-the-art trackers on the benchmark with respect to accuracy. Particularly, the average precision of our algorithm achieves 76.7 % under OPE curve at 20px.

[1]  Huchuan Lu,et al.  Visual tracking via adaptive structural local sparse appearance model , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Huchuan Lu,et al.  Robust object tracking via sparsity-based collaborative model , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Bin Shen,et al.  Online robust image alignment via iterative convex optimization , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Yi Wu,et al.  Online Object Tracking: A Benchmark , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Bruce A. Draper,et al.  Visual object tracking using adaptive correlation filters , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[6]  Lei Zhang,et al.  Real-Time Compressive Tracking , 2012, ECCV.

[7]  M. Kloft,et al.  l p -Norm Multiple Kernel Learning , 2011 .

[8]  Rui Caseiro,et al.  High-Speed Tracking with Kernelized Correlation Filters , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Zdenek Kalal,et al.  Tracking-Learning-Detection , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Horst Bischof,et al.  Real-Time Tracking via On-line Boosting , 2006, BMVC.

[11]  Ming-Hsuan Yang,et al.  Robust Object Tracking with Online Multiple Instance Learning , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Philip H. S. Torr,et al.  Struck: Structured output tracking with kernels , 2011, ICCV.

[13]  Rui Caseiro,et al.  Exploiting the Circulant Structure of Tracking-by-Detection with Kernels , 2012, ECCV.

[14]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.