Simultaneous Visual Recognition and Tracking Based on Joint Decision and Estimation

Visual target tracking and recognition have been increasingly important in video surveillance. Conventional works deal with tracking and recognition as separate steps, whereas tracking and recognition are closely interrelated and can help each other potentially and significantly. To tackle this problem, based on the joint decision and estimation (JDE) model which guarantees the general decision (recognition) and estimation (tracking) arriving at the global optimization, a simultaneous visual recognition and tracking method is provided. Besides, the structured sparse representation (SSR) model shows great efficiency and robustness in exploiting both holistic and local information of the target appearance. We show that constructing the appearance model with SSR can improve the performance of the proposed algorithm. Then, the contribution of each test candidate is considered into the learning procedure by a kernel function. Furthermore, the new joint weights of the kernel function provide flexibility with appearance changes and thus robustness to the dynamic scene. The experimental results demonstrate that the proposed method performs well in terms of accuracy and robustness.

[1]  Zhongliang Jing,et al.  Kernel joint visual tracking and recognition based on structured sparse representation , 2016, Neurocomputing.

[2]  Youfu Li,et al.  Robust Visual Tracking Using Flexible Structured Sparse Representation , 2014, IEEE Transactions on Industrial Informatics.

[3]  Yonina C. Eldar,et al.  Block-Sparse Signals: Uncertainty Relations and Efficient Recovery , 2009, IEEE Transactions on Signal Processing.

[4]  David J. Kriegman,et al.  Visual tracking and recognition using probabilistic appearance manifolds , 2005, Comput. Vis. Image Underst..

[5]  Allen Y. Yang,et al.  Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  James J. Little,et al.  Learning to Track and Identify Players from Broadcast Sports Videos , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Richard T. Howard,et al.  Orbital Express Advanced Video Guidance Sensor: Ground Testing, Flight Results and Comparisons , 2008 .

[8]  Alan F. Smeaton,et al.  Thermo-visual feature fusion for object tracking using multiple spatiogram trackers , 2007 .

[9]  Rama Chellappa,et al.  Visual tracking and recognition using appearance-adaptive models in particle filters , 2004, IEEE Transactions on Image Processing.

[10]  Vibhav Vineet,et al.  Struck: Structured Output Tracking with Kernels , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Alex Pentland,et al.  View-based and modular eigenspaces for face recognition , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Lei Zhang,et al.  Fast Compressive Tracking , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Lei Zhang,et al.  Real-Time Object Tracking Via Online Discriminative Feature Selection , 2013, IEEE Transactions on Image Processing.

[14]  Hong Sun,et al.  Cluster structured sparse representation for high resolution satellite image classification , 2012, 2012 IEEE 11th International Conference on Signal Processing.

[15]  Ming-Hsuan Yang,et al.  Incremental Learning for Robust Visual Tracking , 2008, International Journal of Computer Vision.

[16]  David Zhang,et al.  Fast Visual Tracking via Dense Spatio-temporal Context Learning , 2014, ECCV.

[17]  D. Donoho,et al.  Atomic Decomposition by Basis Pursuit , 2001 .

[18]  Yu Liu,et al.  Recursive joint decision and estimation based on generalized Bayes risk , 2011, 14th International Conference on Information Fusion.

[19]  Bernhard Schölkopf,et al.  Comparing support vector machines with Gaussian kernels to radial basis function classifiers , 1997, IEEE Trans. Signal Process..

[20]  David J. Kriegman,et al.  Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.

[21]  Stefanos Zafeiriou,et al.  Efficient Online Subspace Learning With an Indefinite Kernel for Visual Tracking and Recognition , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[22]  Chao Wang,et al.  Face Tracking and Recognition via Incremental Local Sparse Representation , 2013, 2013 Seventh International Conference on Image and Graphics.

[23]  Ying Wu,et al.  What Are We Tracking: A Unified Approach of Tracking and Recognition , 2013, IEEE Transactions on Image Processing.

[24]  David J. Kriegman,et al.  Online learning of probabilistic appearance manifolds for video-based recognition and tracking , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[25]  H. Rauhut Random Sampling of Sparse Trigonometric Polynomials , 2005, math/0512642.

[26]  Stefanos Zafeiriou,et al.  Sparse representations of image gradient orientations for visual recognition and tracking , 2011, CVPR 2011 WORKSHOPS.

[27]  Haibin Ling,et al.  Robust Visual Tracking and Vehicle Classification via Sparse Representation , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Fuchun Sun,et al.  Hierarchical Structured Sparse Representation for T–S Fuzzy Systems Identification , 2013, IEEE Transactions on Fuzzy Systems.

[29]  Nan Jiang,et al.  Learning Adaptive Metric for Robust Visual Tracking , 2011, IEEE Transactions on Image Processing.

[30]  Guoliang Fan,et al.  Joint view-identity manifold for target tracking and recognition , 2012, 2012 19th IEEE International Conference on Image Processing.

[31]  Yangsheng Xu,et al.  Hand tracking and pose recognition via depth and color information , 2012, 2012 IEEE International Conference on Robotics and Biomimetics (ROBIO).

[32]  Ming-Hsuan Yang,et al.  Robust Object Tracking with Online Multiple Instance Learning , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33]  Narendra Ahuja,et al.  Robust visual tracking via multi-task sparse learning , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.