Target-Distractor Aware Deep Tracking With Discriminative Enhancement Learning Loss

Numerous tracking approaches attempt to improve target representation through target-aware or distractor-aware. However, the unbalanced considerations of target or distractor information make it diffcult for these methods to benefit from the two aspects at the same time. In this paper, we propose a target-distractor aware model with discriminative enhancement learning loss to learn target representation, which can better distinguish the target in complex scenes. Firstly, to enlarge the gap between the target and distractor, we design a discriminative enhancement learning loss. By highlighting the hard negatives that are similar to the target and shrinking the easy negatives that are pure background, the features sensitive to the target or distractor representation can be more conveniently mined. On this basis, we further propose a target-distractor aware model. Unlike existing methods of preference target or distractor, we construct the target-specific feature space by activating the target-sensitive and the distractor-silence feature. Therefore, the appearance model can not only represent the target well but also suppress the background distractor. Finally, the target-distractor aware target representation model is integrated with a Siamese matching network for visual tracking for achieving robust and realtime visual tracking. Extensive experiments are performed on eight tracking benchmarks show that the proposed algorithm achieves favorable performance.

[1]  Zikun Zhou,et al.  Noise-Suppressing Deep Tracking , 2021, IEEE Transactions on Circuits and Systems for Video Technology.

[2]  Shengping Zhang,et al.  Distractor-Aware Fast Tracking via Dynamic Convolutions and MOT Philosophy , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Yonghong Tian,et al.  Towards More Flexible and Accurate Object Tracking with Natural Language: Algorithms and Benchmark , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Houqiang Li,et al.  Cascaded Regression Tracking: Towards Online Hard Distractor Discrimination , 2020, IEEE Transactions on Circuits and Systems for Video Technology.

[5]  Xin Zhao,et al.  GOT-10k: A Large High-Diversity Benchmark for Generic Object Tracking in the Wild , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Huanlong Zhang,et al.  Light regression memory and multi-perspective object special proposals for abrupt motion tracking , 2021, Knowl. Based Syst..

[7]  Junhui Hou,et al.  Correlation Filter Tracking via Distractor-Aware Learning and Multi-Anchor Detection , 2020, IEEE Transactions on Circuits and Systems for Video Technology.

[8]  Zhenyu He,et al.  Dual-regression model for visual tracking , 2020, Neural Networks.

[9]  Wei Zhao,et al.  Visual Tracking by Structurally Optimizing Pre-Trained CNN , 2020, IEEE Transactions on Circuits and Systems for Video Technology.

[10]  Zhipeng Zhang,et al.  Ocean: Object-aware Anchor-free Tracking , 2020, ECCV.

[11]  Jiebo Luo,et al.  Self-Supervised Domain-Aware Generative Network for Generalized Zero-Shot Learning , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Seyed Mojtaba Marvasti-Zadeh,et al.  Beyond Background-Aware Correlation Filters: Adaptive Context Modeling by Hand-Crafted and Deep RGB Features for Visual Tracking , 2020, ArXiv.

[13]  Pan Wang,et al.  Adaptive Discriminative Deep Correlation Filter for Visual Object Tracking , 2020, IEEE Transactions on Circuits and Systems for Video Technology.

[14]  Ying Cui,et al.  SiamCAR: Siamese Fully Convolutional Classification and Regression for Visual Tracking , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Ross B. Girshick,et al.  Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Jian-Zeng Li,et al.  Distractor-Aware Long-Term Correlation Tracking Based on Information Entropy Weighted Feature , 2020, IEEE Access.

[17]  Dongdong Li,et al.  Target-Aware Correlation Filter Tracking in RGBD Videos , 2019, IEEE Sensors Journal.

[18]  Filiz Gurkan,et al.  Target Aware Visual Object Tracking , 2019, ICIAR.

[19]  Fumin Shen,et al.  Real-Time Deep Tracking via Corrective Domain Adaptation , 2019, IEEE Transactions on Circuits and Systems for Video Technology.

[20]  Huchuan Lu,et al.  Visual Tracking via Adaptive Spatially-Regularized Correlation Filters , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Jun Li,et al.  Deep Alignment Network Based Multi-Person Tracking With Occlusion and Motion Reasoning , 2019, IEEE Transactions on Multimedia.

[22]  Jun Li,et al.  Hierarchical Tracking by Reinforcement Learning-Based Searching and Coarse-to-Fine Verifying , 2019, IEEE Transactions on Image Processing.

[23]  L. Gool,et al.  Learning Discriminative Model Prediction for Tracking , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[24]  Zhenyu He,et al.  Target-Aware Deep Tracking , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Tianzhu Zhang,et al.  Deep Multi-Modality Adversarial Networks for Unsupervised Domain Adaptation , 2019, IEEE Transactions on Multimedia.

[26]  Gongjian Wen,et al.  Learning target-aware correlation filters for visual tracking , 2019, J. Vis. Commun. Image Represent..

[27]  Ming Du,et al.  Distractor-Aware Deep Regression for Visual Tracking , 2019, Sensors.

[28]  Wei Wu,et al.  SiamRPN++: Evolution of Siamese Visual Tracking With Very Deep Networks , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  Michael Felsberg,et al.  ATOM: Accurate Tracking by Overlap Maximization , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  Fan Yang,et al.  LaSOT: A High-Quality Benchmark for Large-Scale Single Object Tracking , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[31]  Junseok Kwon,et al.  Deep Meta Learning for Real-Time Target-Aware Visual Tracking , 2017, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[32]  Ming-Hsuan Yang,et al.  Robust Visual Tracking via Hierarchical Convolutional Features , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33]  Bin Luo,et al.  Learning Target-aware Attention for Robust Tracking with Conditional Adversarial Network , 2019, BMVC.

[34]  Xiaoan Tang,et al.  Trajectory Smoothing Constraint and Hard Negative Mining for Distractor-Aware Regression Tracking , 2019, IEEE Access.

[35]  Bingbing Ni,et al.  Deep Regression Tracking with Shrinkage Loss , 2018, ECCV.

[36]  Wei Wu,et al.  High Performance Visual Tracking with Siamese Region Proposal Network , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[37]  Rynson W. H. Lau,et al.  VITAL: VIsual Tracking via Adversarial Learning , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[38]  Bernard Ghanem,et al.  TrackingNet: A Large-Scale Dataset and Benchmark for Object Tracking in the Wild , 2018, ECCV.

[39]  Hongdong Li,et al.  Not All Negatives Are Equal: Learning to Track With Multiple Background Clusters , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[40]  Wenbing Tao,et al.  Convolutional Regression for Visual Tracking , 2016, IEEE Transactions on Image Processing.

[41]  Michael Felsberg,et al.  The Visual Object Tracking VOT2017 Challenge Results , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[42]  Rynson W. H. Lau,et al.  CREST: Convolutional Residual Learning for Visual Tracking , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[43]  Jiri Matas,et al.  Discriminative Correlation Filter with Channel and Spatial Reliability , 2017, CVPR.

[44]  Bernard Ghanem,et al.  Context-Aware Correlation Filter Tracking , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[45]  Simon Lucey,et al.  Learning Background-Aware Correlation Filters for Visual Tracking , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[46]  Michael Felsberg,et al.  ECO: Efficient Convolution Operators for Tracking , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[47]  Abhishek Das,et al.  Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[48]  Eraldo Ribeiro,et al.  Object-aware tracking , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[49]  Bernard Ghanem,et al.  A Benchmark and Simulator for UAV Tracking , 2016, ECCV.

[50]  Ales Leonardis,et al.  Distractor-Supported Single Target Tracking in Extremely Cluttered Scenes , 2016, ECCV.

[51]  Luca Bertinetto,et al.  Fully-Convolutional Siamese Networks for Object Tracking , 2016, ECCV Workshops.

[52]  Bolei Zhou,et al.  Learning Deep Features for Discriminative Localization , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[53]  Bohyung Han,et al.  Learning Multi-domain Convolutional Neural Networks for Visual Tracking , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[54]  Changsheng Xu,et al.  Robust Visual Tracking via Exclusive Context Modeling , 2016, IEEE Transactions on Cybernetics.

[55]  Ming-Hsuan Yang,et al.  Hierarchical Convolutional Features for Visual Tracking , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[56]  Michael Felsberg,et al.  Learning Spatially Regularized Correlation Filters for Visual Tracking , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[57]  Michael Felsberg,et al.  Convolutional Features for Correlation Filter Based Visual Tracking , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[58]  Erik Blasch,et al.  Encoding color information for visual tracking: Algorithms and benchmark , 2015, IEEE Transactions on Image Processing.

[59]  Thomas Mauthner,et al.  In defense of color-based model-free tracking , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[60]  Rui Caseiro,et al.  High-Speed Tracking with Kernelized Correlation Filters , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[61]  Yi Wu,et al.  Online Object Tracking: A Benchmark , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.