Joint spatiograms for multi-modality tracking with online update

Abstract Integrating multiple different yet complementary modalities has been proved to be an effective way for boosting tracking performance. Many previous works just consider the fusion of different features from identical image or identical features from images with different spectrum alone, which makes them be quite distinct from each other and be hard to be integrated naturally. In this study, we propose a unified tracking framework to integrate multiple different modalities via innovative use of spatiogram, where the spatiogram is formed by weighting each bin of histogram with the mean and covariance of the locations of the pixels that contribute to that bin. Specifically, each modal target and its candidate are first represented by second-order spatiogram, and their similarity is measured by the weighted Bhattacharayya coefficient. Next, an objective function is built by integrating all modal similarities, then a joint center-shift formula of the target is gained by performing Taylor expansion and gradient minimization on the objective function. Finally, the optimal target location is gained recursively by applying the mean shift procedure. Besides, a fast fuzzy logic system is designed to adaptively adjust the weight of each modality, and a model update scheme based on particle filter is developed to capture the appearance variations. Our framework allows the modality to be original gray of pixel or other extracted feature from single image or different spectral images, and provides the flexibility to arbitrarily add or remove modality. Experimental results on three challenging public datasets demonstrate clearly the robustness and effectiveness of the proposed method.

[1]  Xiaochun Cao,et al.  Fusing two-stream convolutional neural networks for RGB-T object tracking , 2017, Neurocomputing.

[2]  Dorin Comaniciu,et al.  Kernel-Based Object Tracking , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Rynson W. H. Lau,et al.  Visual Tracking via Locality Sensitive Histograms , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Patrick Pérez,et al.  Color-Based Probabilistic Tracking , 2002, ECCV.

[5]  Michael J. Black,et al.  EigenTracking: Robust Matching and Tracking of Articulated Objects Using a View-Based Representation , 1996, International Journal of Computer Vision.

[6]  Luca Bertinetto,et al.  End-to-End Representation Learning for Correlation Filter Based Tracking , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Michael Felsberg,et al.  Discriminative Scale Space Tracking , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Antonio Torralba,et al.  Visualizing Object Detection Features , 2015, International Journal of Computer Vision.

[9]  Haibin Ling,et al.  Real time robust L1 tracker using accelerated proximal gradient approach , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Canlong Zhang,et al.  Spatiogram and Fuzzy Logic Based Multi-modality Fusion Tracking with Online Update , 2017, CCCV.

[11]  Gang Xiao,et al.  A new tracking approach for visible and infrared sequences based on tracking-before-fusion , 2014, International Journal of Dynamics and Control.

[12]  Fuchun Sun,et al.  Fusion tracking in color and infrared images using joint sparse representation , 2012, Science China Information Sciences.

[13]  Bohyung Han,et al.  Learning Multi-domain Convolutional Neural Networks for Visual Tracking , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Rui Caseiro,et al.  High-Speed Tracking with Kernelized Correlation Filters , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  T. Ross Fuzzy Logic with Engineering Applications , 1994 .

[16]  Jin Tang,et al.  RGB-T Object Tracking: Benchmark and Baseline , 2018, Pattern Recognit..

[17]  Haibin Ling,et al.  Robust visual tracking using ℓ1 minimization , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[18]  Michael Felsberg,et al.  Adaptive Color Attributes for Real-Time Visual Tracking , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Huchuan Lu,et al.  Multi-feature tracking via adaptive weights , 2016, Neurocomputing.

[20]  Pong C. Yuen,et al.  Multi-cue Visual Tracking Using Robust Feature-Level Fusion Based on Joint Sparse Representation , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[21]  Peng Ning-song,et al.  Model update mechanism for mean-shift tracking , 2005 .

[22]  Yi Li,et al.  DeepTrack: Learning Discriminative Feature Representations Online for Robust Visual Tracking , 2015, IEEE Transactions on Image Processing.

[23]  Jin Tang,et al.  Grayscale-Thermal Object Tracking via Multitask Laplacian Sparse Representation , 2017, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[24]  Zhen Cui,et al.  Recurrently Target-Attending Tracking , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Michael Isard,et al.  CONDENSATION—Conditional Density Propagation for Visual Tracking , 1998, International Journal of Computer Vision.

[26]  Matti Pietikäinen,et al.  Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[27]  Gang Xiao,et al.  A compressive tracking based on time-space Kalman fusion model , 2015, Science China Information Sciences.

[28]  Shuicheng Yan,et al.  NUS-PRO: A New Visual Tracking Challenge , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  Anastasios Tefas,et al.  Visual Object Tracking Based on Local Steering Kernels and Color Histograms , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[30]  Jia Liu,et al.  Multi-feature Hashing Tracking , 2016, Pattern Recognit. Lett..

[31]  Hui Cheng,et al.  Learning Collaborative Sparse Representation for Grayscale-Thermal Tracking , 2016, IEEE Transactions on Image Processing.

[32]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[33]  Michael Felsberg,et al.  Learning Spatially Regularized Correlation Filters for Visual Tracking , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[34]  Stanley T. Birchfield,et al.  Spatiograms versus histograms for region-based tracking , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[35]  Canlong Zhang,et al.  Joint compressive representation for multi-feature tracking , 2018, Neurocomputing.