Hierarchical convolutional features for visual tracking via two combined color spaces with SVM classifier

AbstractAs the state-of-the-art object trackers majority, hierarchical convolutional features (HCF) cannot recover tracking processes from problems of drifting caused by several challenges, especially by heavy occlusion, scale variation, and illumination variation. In this paper, we present a new effective method with the aim of treating these challenges robustly based on two principal tasks. First, we infer the target location using multichannel correlation maps, resulting from the combination of five learned correlation filters with convolutional features. In order to handle the illumination variation and get more rich features, we exploit an HSV energy condition to control the use of two color spaces, RGB and HSV. Second, we use the histogram of gradient features to learn another correlation filter in order to estimate the scale variation. Furthermore, we exploit an online training SVM classifier for target re-detecting in failure cases. The extensive experiments on a commonly used tracking benchmark dataset justify that our tracker significantly improves HCF and outperforms the state-of-the-art methods.

[1]  Michael Felsberg,et al.  Accurate Scale Estimation for Robust Visual Tracking , 2014, BMVC.

[2]  Zdenek Kalal,et al.  Tracking-Learning-Detection , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Yihong Gong,et al.  Human Tracking Using Convolutional Neural Networks , 2010, IEEE Transactions on Neural Networks.

[4]  Ming Zhu,et al.  Obstacle detection in single images with deep neural networks , 2016, Signal Image Video Process..

[5]  Horst Bischof,et al.  Semi-supervised On-Line Boosting for Robust Tracking , 2008, ECCV.

[6]  Philip H. S. Torr,et al.  Struck: Structured output tracking with kernels , 2011, ICCV.

[7]  Dit-Yan Yeung,et al.  Learning a Deep Compact Image Representation for Visual Tracking , 2013, NIPS.

[8]  Chong Luo,et al.  A Twofold Siamese Network for Real-Time Object Tracking , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[9]  Dhananjay Kumar,et al.  An efficient system for anomaly detection using deep learning classifier , 2017, Signal Image Video Process..

[10]  David Nistér,et al.  Scalable Recognition with a Vocabulary Tree , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[11]  Zhenyu He,et al.  The Visual Object Tracking VOT2016 Challenge Results , 2016, ECCV Workshops.

[12]  Adil Mehmood Khan,et al.  Using deep features for video scene detection and annotation , 2018, Signal, Image and Video Processing.

[13]  Qi Tian,et al.  Multi-cue Correlation Filters for Robust Visual Tracking , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[14]  Chen Chen,et al.  Output Constraint Transfer for Kernelized Correlation Filter in Tracking , 2016, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[15]  Huchuan Lu,et al.  Robust Object Tracking via Sparse Collaborative Appearance Model , 2014, IEEE Transactions on Image Processing.

[16]  Haibin Ling,et al.  Parallel Tracking and Verifying: A Framework for Real-Time and High Accuracy Visual Tracking , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[17]  Bernard Ghanem,et al.  A Benchmark and Simulator for UAV Tracking , 2016, ECCV.

[18]  Ming-Hsuan Yang,et al.  Hierarchical Convolutional Features for Visual Tracking , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[19]  Ming-Hsuan Yang,et al.  Incremental Learning for Robust Visual Tracking , 2008, International Journal of Computer Vision.

[21]  Bingbing Ni,et al.  When Correlation Filters Meet Convolutional Neural Networks for Visual Tracking , 2016, IEEE Signal Processing Letters.

[22]  Jianke Zhu,et al.  A Scale Adaptive Kernel Correlation Filter Tracker with Feature Integration , 2014, ECCV Workshops.

[23]  Bohyung Han,et al.  Learning Multi-domain Convolutional Neural Networks for Visual Tracking , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  Ming-Hsuan Yang,et al.  Long-term correlation tracking , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Yi Wu,et al.  Online Object Tracking: A Benchmark , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[26]  Jason Jianjun Gu,et al.  Correlation filter-based self-paced object tracking , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[27]  Jin Gao,et al.  Transfer Learning Based Visual Tracking with Gaussian Processes Regression , 2014, ECCV.

[28]  Minjuan Wang,et al.  A convolutional neural network-based flame detection method in video sequence , 2018, Signal Image Video Process..

[29]  Huseyin Ozkan,et al.  Extending Correlation Filter-Based Visual Tracking by Tree-Structured Ensemble and Spatial Windowing , 2017, IEEE Transactions on Image Processing.

[30]  Rui Caseiro,et al.  High-Speed Tracking with Kernelized Correlation Filters , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[32]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[33]  Narendra Ahuja,et al.  Robust visual tracking via multi-task sparse learning , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[34]  Yulong Xu,et al.  Robust Scale Adaptive Kernel Correlation Filter Tracker With Hierarchical Convolutional Features , 2016, IEEE Signal Processing Letters.

[35]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[36]  Michael Felsberg,et al.  Learning Spatially Regularized Correlation Filters for Visual Tracking , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[37]  Ming-Hsuan Yang,et al.  Robust Object Tracking with Online Multiple Instance Learning , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  Luca Bertinetto,et al.  Fully-Convolutional Siamese Networks for Object Tracking , 2016, ECCV Workshops.

[39]  Bin Xu,et al.  Real-time visual tracking via robust Kernelized Correlation Filter , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[40]  Luca Bertinetto,et al.  Staple: Complementary Learners for Real-Time Tracking , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Simon Lucey,et al.  Learning Background-Aware Correlation Filters for Visual Tracking , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[42]  Michael Felsberg,et al.  Beyond Correlation Filters: Learning Continuous Convolution Operators for Visual Tracking , 2016, ECCV.

[43]  Nadjiba Terki,et al.  Learning spatially correlation filters based on convolutional features via PSO algorithm and two combined color spaces for visual tracking , 2017, Applied Intelligence.

[44]  Bruce A. Draper,et al.  Visual object tracking using adaptive correlation filters , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[45]  Simon Lucey,et al.  Learning Policies for Adaptive Tracking with Deep Feature Cascades , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[46]  Pietro Perona,et al.  Fast Feature Pyramids for Object Detection , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[47]  Seunghoon Hong,et al.  Online Tracking by Learning Discriminative Saliency Map with Convolutional Neural Network , 2015, ICML.

[48]  Stan Sclaroff,et al.  MEEM: Robust Tracking via Multiple Experts Using Entropy Minimization , 2014, ECCV.

[49]  Xia Li,et al.  Kernalised Multi-resolution Convnet for Visual Tracking , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[50]  Liming Chen,et al.  Face age classification based on a deep hybrid model , 2018, Signal Image Video Process..