Scale and Appearance Variation Enhanced Siamese Network for Thermal Infrared Target Tracking

Abstract Thermal infrared target tracking has recently attracted much attention. Although various methods have been proposed, they are usually sensitive to the target appearance and scale variation which will lead to tracking deviation. Therefore, in this paper, we propose a novel scale and appearance variation enhanced siamese network to solve the above problem. First, we introduce the dilated convolution module into siamese network to enlarge the receptive field of network while maintaining the resolution of feature maps. Hence, the model adaptability to the target with different scales could be enhanced. In addition, we devise a target template library update strategy based on the tracking results of historical frames to reduce the dependence of the network on the initially given exemplar. In this way, the appearance change of targets could be better described so as to overcome the tracking deviation problem over time. Qualitative and quantitative results on PTB-TIR, LSOTB-TIR and VOT-TIR2015 datasets demonstrate that our proposed model is able to achieve superior thermal infrared target tracking results. Especially, when the target appearance and scale have significant variations, our model shows outstanding performance than other state-of-the-art methods.

[1]  Zhenyu He,et al.  The Thermal Infrared Visual Object Tracking VOT-TIR2016 Challenge Results , 2016, ECCV Workshops.

[2]  Michael Felsberg,et al.  Channel Coded Distribution Field Tracking for Thermal Infrared Imagery , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[3]  Wei Wu,et al.  High Performance Visual Tracking with Siamese Region Proposal Network , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[4]  Pong C. Yuen,et al.  Learning Modality-Consistency Feature Templates: A Robust RGB-Infrared Tracking System , 2019, IEEE Transactions on Industrial Electronics.

[5]  Zhenyu He,et al.  Deep convolutional neural networks for thermal infrared object tracking , 2017, Knowl. Based Syst..

[6]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Luca Bertinetto,et al.  Staple: Complementary Learners for Real-Time Tracking , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Zhenyu He,et al.  Target-Aware Deep Tracking , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Rynson W. H. Lau,et al.  CREST: Convolutional Residual Learning for Visual Tracking , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[10]  Song Wang,et al.  Learning Dynamic Siamese Network for Visual Object Tracking , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[11]  Bin Sun,et al.  Thermal infrared and visible sequences fusion tracking based on a hybrid tracking framework with adaptive weighting scheme , 2019, Infrared Physics & Technology.

[12]  Qiang Wang,et al.  Fast Online Object Tracking and Segmentation: A Unifying Approach , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Luca Bertinetto,et al.  End-to-End Representation Learning for Correlation Filter Based Tracking , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Zhenyu He,et al.  LSOTB-TIR: A Large-Scale High-Diversity Thermal Infrared Object Tracking Benchmark , 2020, ACM Multimedia.

[15]  Qi Tian,et al.  Multi-cue Correlation Filters for Robust Visual Tracking , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[16]  Jian Zhang,et al.  Robust tracking algorithm for infrared target via correlation filter and particle filter , 2020 .

[17]  Liying Zheng,et al.  Thermal infrared pedestrian tracking using joint siamese network and exemplar prediction model , 2020, Pattern Recognit. Lett..

[18]  Michael Felsberg,et al.  Learning Spatially Regularized Correlation Filters for Visual Tracking , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[19]  Yang Liu,et al.  Robust thermal infrared object tracking with continuous correlation filters and adaptive feature fusion , 2019, Infrared Physics & Technology.

[20]  Michael Felsberg,et al.  Accurate Scale Estimation for Robust Visual Tracking , 2014, BMVC.

[21]  Zhenyu He,et al.  Learning Deep Multi-Level Similarity for Thermal Infrared Object Tracking , 2019, IEEE Transactions on Multimedia.

[22]  Wei Liu,et al.  Unsupervised Deep Tracking , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Feng Li,et al.  Learning Spatial-Temporal Regularized Correlation Filters for Visual Tracking , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[24]  Bo Du,et al.  MeMu: Metric correlation Siamese network and multi-class negative sampling for visual tracking , 2020, Pattern Recognit..

[25]  Han Wu,et al.  A Real-time Robust Approach for Tracking UAVs in Infrared Videos , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[26]  Wei Wu,et al.  SiamRPN++: Evolution of Siamese Visual Tracking With Very Deep Networks , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Chong Luo,et al.  A Twofold Siamese Network for Real-Time Object Tracking , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[28]  Luca Bertinetto,et al.  Fully-Convolutional Siamese Networks for Object Tracking , 2016, ECCV Workshops.

[29]  C S Asha,et al.  Robust infrared target tracking using discriminative and generative approaches , 2017 .

[30]  Michael Felsberg,et al.  ECO: Efficient Convolution Operators for Tracking , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[31]  Simon Lucey,et al.  Learning Background-Aware Correlation Filters for Visual Tracking , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[32]  Qifeng Yu,et al.  Dense structural learning for infrared object tracking at 200+ Frames per Second , 2017, Pattern Recognit. Lett..

[33]  Huchuan Lu,et al.  Structured Siamese Network for Real-Time Visual Tracking , 2018, ECCV.

[34]  Zhenyu He,et al.  Hierarchical spatial-aware Siamese network for thermal infrared object tracking , 2017, Knowl. Based Syst..

[35]  Jianbing Shen,et al.  Triplet Loss in Siamese Network for Object Tracking , 2018, ECCV.

[36]  Fahad Shahbaz Khan,et al.  Synthetic Data Generation for End-to-End Thermal Infrared Tracking , 2018, IEEE Transactions on Image Processing.

[37]  Huseyin Seckin Demir,et al.  Part-Based Co-Difference Object Tracking Algorithm for Infrared Videos , 2018, ICIP.

[38]  Zhenyu He,et al.  PTB-TIR: A Thermal Infrared Pedestrian Tracking Benchmark , 2018, IEEE Transactions on Multimedia.

[39]  Michael Felsberg,et al.  The Thermal Infrared Visual Object Tracking VOT-TIR2015 Challenge Results , 2015, ICCV Workshops.

[40]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.