Kernel joint visual tracking and recognition based on structured sparse representation

Visual target tracking and recognition are closely interrelated and can help each other potentially and significantly. The optimal Bayes joint decision and estimation (JDE) model guarantees the general decision (recognition) and estimation (tracking) arriving at the global optimization. In this paper, a kernel joint tracking and recognition (KJTR) algorithm is proposed to apply JDE to the simultaneous visual tracking and recognition problem and use the iteration learning procedure to achieve optimization. Besides, the structured sparse representation (SSR) model shows great efficiency and robustness in exploiting both holistic and local information of the target appearance. We show that constructing the appearance model with SSR can improve the performance of the proposed algorithm. Then, the contribution of each test candidate is considered into the learning procedure by a kernel function. Furthermore, the new joint weights of KJTR provide flexibility with appearance changes and thus robustness to the dynamic scene. The experimental results demonstrate that the KJTR algorithm performs well in terms of accuracy and robustness.

[1]  Philip H. S. Torr,et al.  Struck: Structured output tracking with kernels , 2011, ICCV.

[2]  David J. Kriegman,et al.  Visual tracking and recognition using probabilistic appearance manifolds , 2005, Comput. Vis. Image Underst..

[3]  Yangsheng Xu,et al.  Hand tracking and pose recognition via depth and color information , 2012, 2012 IEEE International Conference on Robotics and Biomimetics (ROBIO).

[4]  Yonina C. Eldar,et al.  Block-sparsity: Coherence and efficient recovery , 2008, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[5]  Chao Wang,et al.  Face Tracking and Recognition via Incremental Local Sparse Representation , 2013, 2013 Seventh International Conference on Image and Graphics.

[6]  Stefanos Zafeiriou,et al.  Sparse representations of image gradient orientations for visual recognition and tracking , 2011, CVPR 2011 WORKSHOPS.

[7]  Stefanos Zafeiriou,et al.  Efficient Online Subspace Learning With an Indefinite Kernel for Visual Tracking and Recognition , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[8]  Rama Chellappa,et al.  Visual tracking and recognition using appearance-adaptive models in particle filters , 2004, IEEE Transactions on Image Processing.

[9]  James J. Little,et al.  Learning to Track and Identify Players from Broadcast Sports Videos , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  X. Rong Li,et al.  Optimal bayes joint decision and estimation , 2007, 2007 10th International Conference on Information Fusion.

[11]  Masaki Hayashi,et al.  Multiple players tracking and identification using group detection and player number recognition in sports video , 2013, IECON 2013 - 39th Annual Conference of the IEEE Industrial Electronics Society.

[12]  WuYing,et al.  What Are We Tracking , 2013 .

[13]  Alex Pentland,et al.  View-based and modular eigenspaces for face recognition , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[14]  Narendra Ahuja,et al.  Robust visual tracking via multi-task sparse learning , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  C EldarYonina,et al.  Block-sparse signals , 2010 .

[16]  Michael A. Saunders,et al.  Atomic Decomposition by Basis Pursuit , 1998, SIAM J. Sci. Comput..

[17]  Lei Zhang,et al.  Fast Compressive Tracking , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Alan F. Smeaton,et al.  Thermo-visual feature fusion for object tracking using multiple spatiogram trackers , 2007 .

[19]  Hong Sun,et al.  Cluster structured sparse representation for high resolution satellite image classification , 2012, 2012 IEEE 11th International Conference on Signal Processing.

[20]  Shaohua Kevin Zhou,et al.  Integrated Detection, Tracking and Recognition for IR Video-Based Vehicle Classification , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[21]  Nan Jiang,et al.  Learning Adaptive Metric for Robust Visual Tracking , 2011, IEEE Transactions on Image Processing.

[22]  Yonina C. Eldar,et al.  Robust Recovery of Signals From a Structured Union of Subspaces , 2008, IEEE Transactions on Information Theory.

[23]  Yu Liu,et al.  Recursive joint decision and estimation based on generalized Bayes risk , 2011, 14th International Conference on Information Fusion.

[24]  Bernhard Schölkopf,et al.  Comparing support vector machines with Gaussian kernels to radial basis function classifiers , 1997, IEEE Trans. Signal Process..

[25]  Haibin Ling,et al.  Robust Visual Tracking and Vehicle Classification via Sparse Representation , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  David Zhang,et al.  Fast Visual Tracking via Dense Spatio-temporal Context Learning , 2014, ECCV.

[27]  Guoliang Fan,et al.  Joint view-identity manifold for target tracking and recognition , 2012, 2012 19th IEEE International Conference on Image Processing.

[28]  Roberto Prevete,et al.  A linear approach for sparse coding by a two-layer neural network , 2015, Neurocomputing.

[29]  Harish Bhaskar Integrated human target detection, identification and tracking for surveillance applications , 2012, 2012 6th IEEE International Conference Intelligent Systems.

[30]  Fuchun Sun,et al.  Spatial Neighborhood-Constrained Linear Coding for Visual Object Tracking , 2014, IEEE Transactions on Industrial Informatics.

[31]  David J. Kriegman,et al.  Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.

[32]  Andrea Cavallaro,et al.  Correlation-based self-correcting tracking , 2015, Neurocomputing.

[33]  J KriegmanDavid,et al.  Eigenfaces vs. Fisherfaces , 1997 .

[34]  Joel A. Tropp,et al.  Signal Recovery From Random Measurements Via Orthogonal Matching Pursuit , 2007, IEEE Transactions on Information Theory.

[35]  Ming-Hsuan Yang,et al.  Robust Object Tracking with Online Multiple Instance Learning , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36]  Yazhe Tang,et al.  Structured sparse representation appearance model for robust visual tracking , 2011, 2011 IEEE International Conference on Robotics and Automation.

[37]  Chunhua Shen,et al.  Real-time visual tracking using compressive sensing , 2011, CVPR 2011.

[38]  Z. M. Hefed Object tracking , 1999 .

[39]  Youfu Li,et al.  Robust Visual Tracking Using Flexible Structured Sparse Representation , 2014, IEEE Transactions on Industrial Informatics.

[40]  Vladimir Pavlovic,et al.  Face tracking and recognition with visual constraints in real-world videos , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[41]  Allen Y. Yang,et al.  Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[42]  Richard T. Howard,et al.  Orbital Express Advanced Video Guidance Sensor: Ground Testing, Flight Results and Comparisons , 2008 .

[43]  Ming-Hsuan Yang,et al.  Incremental Learning for Robust Visual Tracking , 2008, International Journal of Computer Vision.

[44]  Ying Wu,et al.  What Are We Tracking: A Unified Approach of Tracking and Recognition , 2013, IEEE Transactions on Image Processing.

[45]  David J. Kriegman,et al.  Online learning of probabilistic appearance manifolds for video-based recognition and tracking , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[46]  X. Rong Li,et al.  Joint tracking and classification of extended object using random matrix , 2013, Proceedings of the 16th International Conference on Information Fusion.

[47]  H. Rauhut Random Sampling of Sparse Trigonometric Polynomials , 2005, math/0512642.

[48]  Rabab K. Ward,et al.  Fast group sparse classification , 2009 .

[49]  Vinod Chandran,et al.  Random Gabor based templates for facial expression recognition in images with facial occlusion , 2014, Neurocomputing.

[50]  Junseok Kwon,et al.  Visual tracking decomposition , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[51]  Weiguang Li,et al.  Structured Sparse Representation Visual Tracking Using Bayes Classifier , 2013, 2013 IEEE International Conference on Systems, Man, and Cybernetics.

[52]  Babak Hassibi,et al.  On the reconstruction of block-sparse signals with an optimal number of measurements , 2009, IEEE Trans. Signal Process..

[53]  Lei Zhang,et al.  Real-Time Object Tracking Via Online Discriminative Feature Selection , 2013, IEEE Transactions on Image Processing.

[54]  Huchuan Lu,et al.  Visual tracking via adaptive structural local sparse appearance model , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[55]  Yonina C. Eldar,et al.  Block-Sparse Signals: Uncertainty Relations and Efficient Recovery , 2009, IEEE Transactions on Signal Processing.

[56]  Fuchun Sun,et al.  Hierarchical Structured Sparse Representation for T–S Fuzzy Systems Identification , 2013, IEEE Transactions on Fuzzy Systems.