Learning salient features to prevent model drift for correlation tracking

Abstract Correlation Filter (CF) based algorithms play an important role in the field of Visual Object Tracking (VOT) due to their high accuracy and low computational complexity. While existing CF tracking algorithms suffer performance degradation due to inaccurate object modeling. In this paper, we improve the object modeling accuracy in both CF training stage and target detection procedure to preventing the drift problem. Specifically, we propose a multi-model structure for CF trackers to capture the target appearance changes, where different appearance models are trained with specific samples to catch the salient features of the target and reduce the computational cost. Furthermore, a space filter for detection features is designed to suppress the boundary effect under Gaussian motion prior, which contributes to improving the accuracy of position estimation. We deploy our method to three hand-crafted features based CF trackers to perform real-time visual tracking on popular benchmarks. The experimental results demonstrate the efficacy of our proposed scheme and the efficiency of our trackers. In addition, we provide a comprehensive analysis of the proposed method to facilitate application.

[1]  Cordelia Schmid,et al.  Applying Color Names to Image Description , 2007, 2007 IEEE International Conference on Image Processing.

[2]  Jin Wang,et al.  Complexity and Algorithms for Superposed Data Uploading Problem in Networks With Smart Devices , 2020, IEEE Internet of Things Journal.

[3]  Haiyan Liu,et al.  An improved linear kernel for complementary maximal strip recovery: Simpler and smaller , 2019, Theor. Comput. Sci..

[4]  Zheng Chang,et al.  Real-Time Robust Tracking for Motion Blur and Fast Motion via Correlation Filters , 2016, Sensors.

[5]  Yongdong Zhang,et al.  STAT: Spatial-Temporal Attention Mechanism for Video Captioning , 2020, IEEE Transactions on Multimedia.

[6]  Bohyung Han,et al.  Learning Multi-domain Convolutional Neural Networks for Visual Tracking , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Cordelia Schmid,et al.  Learning Color Names from Real-World Images , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Qingming Huang,et al.  Hedged Deep Tracking , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[10]  Horst Bischof,et al.  PROST: Parallel robust online simple tracking , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[11]  Ming-Hsuan Yang,et al.  Long-term correlation tracking , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Michael Felsberg,et al.  Adaptive Decontamination of the Training Set: A Unified Formulation for Discriminative Visual Tracking , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Yi Wu,et al.  Online Object Tracking: A Benchmark , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[14]  Simon Lucey,et al.  Learning Background-Aware Correlation Filters for Visual Tracking , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[15]  Bernard Ghanem,et al.  Target Response Adaptation for Correlation Filter Tracking , 2016, ECCV.

[16]  Yongdong Zhang,et al.  Adaptive weighted imbalance learning with application to abnormal activity recognition , 2016, Neurocomputing.

[17]  Cordelia Schmid,et al.  Learning Color Names for Real-World Applications , 2009, IEEE Transactions on Image Processing.

[18]  Ji Wan,et al.  SOML: Sparse Online Metric Learning with Application to Image Retrieval , 2014, AAAI.

[19]  Qi Tian,et al.  Multi-cue Correlation Filters for Robust Visual Tracking , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[20]  Rui Yang,et al.  Learning multi-channel correlation filter bank for eye localization , 2016, Neurocomputing.

[21]  Sheng Zhong,et al.  Motion Correlation Discovery for Visual Tracking , 2018, IEEE Signal Processing Letters.

[22]  Changyun Wen,et al.  Real-Time Event-Triggered Object Tracking in the Presence of Model Drift and Occlusion , 2019, IEEE Transactions on Industrial Electronics.

[23]  Rui Caseiro,et al.  Exploiting the Circulant Structure of Tracking-by-Detection with Kernels , 2012, ECCV.

[24]  Michael Felsberg,et al.  ECO: Efficient Convolution Operators for Tracking , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  J. Dinesh Peter,et al.  Perspective model-based visual tracking scheme for robust tracking of objects in complex environs , 2017, Multimedia Tools and Applications.

[26]  Ji Wan,et al.  Sparse Online Learning of Image Similarity , 2017, ACM Trans. Intell. Syst. Technol..

[27]  Haibin Ling,et al.  Parallel Tracking and Verifying: A Framework for Real-Time and High Accuracy Visual Tracking , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[28]  Bernard Ghanem,et al.  A Benchmark and Simulator for UAV Tracking , 2016, ECCV.

[29]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[30]  Michael Felsberg,et al.  Learning Spatially Regularized Correlation Filters for Visual Tracking , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[31]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[32]  Michael Felsberg,et al.  Accurate Scale Estimation for Robust Visual Tracking , 2014, BMVC.

[33]  Ming-Hsuan Yang,et al.  Adaptive Correlation Filters with Long-Term and Short-Term Memory for Object Tracking , 2017, International Journal of Computer Vision.

[34]  Ming-Hsuan Yang,et al.  Object Tracking Benchmark , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Simon Lucey,et al.  Correlation filters with limited boundaries , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[36]  Feng Li,et al.  Learning Spatial-Temporal Regularized Correlation Filters for Visual Tracking , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[37]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[38]  Luca Bertinetto,et al.  Staple: Complementary Learners for Real-Time Tracking , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[39]  Yang Li,et al.  Reliable Patch Trackers: Robust visual tracking by exploiting reliable patches , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[40]  Bernard Ghanem,et al.  Context-Aware Correlation Filter Tracking , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Jianke Zhu,et al.  A Scale Adaptive Kernel Correlation Filter Tracker with Feature Integration , 2014, ECCV Workshops.

[42]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[43]  Feng Liu,et al.  From sample selection to model update: A robust online visual tracking algorithm against drifting , 2016, Neurocomputing.

[44]  Shuangquan Wang,et al.  Unobtrusive Sensing Incremental Social Contexts Using Fuzzy Class Incremental Learning , 2015, 2015 IEEE International Conference on Data Mining.

[45]  David Zhang,et al.  Integrating Boundary and Center Correlation Filters for Visual Tracking with Aspect Ratio Variation , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[46]  Bruce A. Draper,et al.  Visual object tracking using adaptive correlation filters , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[47]  Arnold W. M. Smeulders,et al.  Tracking for Half an Hour , 2017, ArXiv.

[48]  Yong Liu,et al.  Large Margin Object Tracking with Circulant Feature Maps , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[49]  Huchuan Lu,et al.  Visual Tracking via Adaptive Spatially-Regularized Correlation Filters , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[50]  Yulong Xu,et al.  Robust Scale Adaptive Kernel Correlation Filter Tracker With Hierarchical Convolutional Features , 2016, IEEE Signal Processing Letters.

[51]  Fan Li,et al.  Scene-Aware Adaptive Updating for Visual Tracking via Correlation Filters , 2017, Sensors.

[52]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[53]  Stan Sclaroff,et al.  MEEM: Robust Tracking via Multiple Experts Using Entropy Minimization , 2014, ECCV.

[54]  Yongdong Zhang,et al.  A Fast Uyghur Text Detector for Complex Background Images , 2018, IEEE Transactions on Multimedia.

[55]  Michael Felsberg,et al.  Beyond Correlation Filters: Learning Continuous Convolution Operators for Visual Tracking , 2016, ECCV.

[56]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[57]  Dit-Yan Yeung,et al.  Understanding and Diagnosing Visual Tracking Systems , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[58]  Dit-Yan Yeung,et al.  Learning a Deep Compact Image Representation for Visual Tracking , 2013, NIPS.

[59]  Rui Caseiro,et al.  High-Speed Tracking with Kernelized Correlation Filters , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[60]  Changsheng Xu,et al.  Multi-task Correlation Particle Filter for Robust Object Tracking , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[61]  Yue Zhou,et al.  Online Detection of Concept Drift in Visual Tracking , 2014, ICONIP.

[62]  Jin Wang,et al.  Multimodel Framework for Indoor Localization Under Mobile Edge Computing Environment , 2019, IEEE Internet of Things Journal.

[63]  Guoying Zhao,et al.  Spatiotemporal Recurrent Convolutional Networks for Recognizing Spontaneous Micro-Expressions , 2019, IEEE Transactions on Multimedia.

[64]  Rama Chellappa,et al.  Integrated Motion Detection and Tracking for Visual Surveillance , 2006, Fourth IEEE International Conference on Computer Vision Systems (ICVS'06).