A joint appearance model of SRC and MFH for multi-objects tracking

Abstract This paper extends sparse representation based classification (SRC) and multi-feature hashing (MFH) into multi-object tracking task, and proposes a joint appearance model of SRC and MFH, which aims at discriminating different objects effectively. Unlike most previous approaches which only focus on producing appearance models for all targets, we further consider discriminative features for distinguishing difficult pairs of targets. Firstly, an SRC based global discriminative appearance model is designed for discriminating all targets. It formulates tracklets association as an SRC problem. A discriminative dictionary learning approach is introduced, which improves the SRC classification performance. By this way, the global discriminative appearance model can distinguish different targets more effectively. Secondly, an MFH based pairwise appearance model is designed. This pairwise appearance model focuses on distinguishable features from two targets without considering other targets or backgrounds, therefore it is more effective for differentiating specific close-by tracklets pairs. Data association framework is employed to generate final tracks. Considerable performance improvements are shown on challenging data sets, particularly in metrics of identity switches.

[1]  Ramakant Nevatia,et al.  Multi-target tracking by online learning of non-linear motion patterns and robust appearance models , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Yang Yang,et al.  Start from Scratch: Towards Automatically Identifying, Modeling, and Naming Visual Attributes , 2014, ACM Multimedia.

[3]  Kuk-Jin Yoon,et al.  Robust Online Multi-object Tracking Based on Tracklet Confidence and Online Discriminative Appearance Learning , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Michael Elad,et al.  Image Denoising Via Sparse and Redundant Representations Over Learned Dictionaries , 2006, IEEE Transactions on Image Processing.

[5]  Luc Van Gool,et al.  Online Multiperson Tracking-by-Detection from a Single, Uncalibrated Camera , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Ramakant Nevatia,et al.  An online learned CRF model for multi-target tracking , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Yue Gao,et al.  Attribute-Augmented Semantic Hierarchy: Towards a Unified Framework for Content-Based Image Retrieval , 2014, TOMM.

[8]  M. Elad,et al.  $rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation , 2006, IEEE Transactions on Signal Processing.

[9]  Lei Zhang,et al.  Metaface learning for sparse representation based face recognition , 2010, 2010 IEEE International Conference on Image Processing.

[10]  Alexandre Heili,et al.  Exploiting Long-Term Connectivity and Visual Motion in CRF-Based Multi-Person Tracking , 2014, IEEE Transactions on Image Processing.

[11]  Ramakant Nevatia,et al.  How does person identity recognition help multi-person tracking? , 2011, CVPR 2011.

[12]  William T. Freeman,et al.  What makes a good model of natural images? , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Xiaojing Chen,et al.  An Online Learned Elementary Grouping Model for Multi-target Tracking , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[14]  Konrad Schindler,et al.  Multi-target tracking by continuous energy minimization , 2011, CVPR 2011.

[15]  Fumin Shen,et al.  Inductive Hashing on Manifolds , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Ram Nevatia,et al.  Learning to associate: HybridBoosted multi-target tracker for crowded scene , 2009, CVPR.

[17]  Bohyung Han,et al.  Learning occlusion with likelihoods for visual tracking , 2011, 2011 International Conference on Computer Vision.

[18]  Ramakant Nevatia,et al.  Detection and Tracking of Multiple, Partially Occluded Humans by Bayesian Combination of Edgelet based Part Detectors , 2007, International Journal of Computer Vision.

[19]  David Zhang,et al.  Fisher Discrimination Dictionary Learning for sparse representation , 2011, 2011 International Conference on Computer Vision.

[20]  Tat-Seng Chua,et al.  Discrete Image Hashing Using Large Weakly Annotated Photo Collections , 2016, AAAI.

[21]  Zi Huang,et al.  Local image tagging via graph regularized joint group sparsity , 2013, Pattern Recognit..

[22]  Shuicheng Yan,et al.  An HOG-LBP human detector with partial occlusion handling , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[23]  Allen Y. Yang,et al.  Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Ramakant Nevatia,et al.  High performance object detection by collaborative learning of Joint Ranking of Granules features , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[25]  Ke Huang,et al.  Sparse Representation for Signal Classification , 2006, NIPS.

[26]  Ramakant Nevatia,et al.  Multi-Target Tracking by Online Learning a CRF Model of Appearance and Motion Patterns , 2013, International Journal of Computer Vision.

[27]  Anton van den Hengel,et al.  Learning Compact Binary Codes for Visual Tracking , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[28]  Ramakant Nevatia,et al.  Robust Object Tracking by Hierarchical Association of Detection Responses , 2008, ECCV.

[29]  Kidiyo Kpalma,et al.  Multi-object tracking using sparse representation , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[30]  Heng Tao Shen,et al.  Hashing on Nonlinear Manifolds , 2014, IEEE Transactions on Image Processing.

[31]  Huchuan Lu,et al.  Discriminative Hash Tracking With Group Sparsity , 2016, IEEE Transactions on Cybernetics.

[32]  Ramakant Nevatia,et al.  Multi-target tracking by on-line learned discriminative appearance models , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[33]  Zi Huang,et al.  Tag localization with spatial correlations and joint group sparsity , 2011, CVPR 2011.

[34]  Afshin Dehghan,et al.  GMCP-Tracker: Global Multi-object Tracking Using Generalized Minimum Clique Graphs , 2012, ECCV.

[35]  Chuancai Liu,et al.  Two dimensional hashing for visual tracking , 2015, Comput. Vis. Image Underst..

[36]  Shihong Lao,et al.  Multi-object tracking through occlusions by local tracklets filtering and global tracklets association with detection responses , 2009, CVPR.

[37]  Andrea Cavallaro,et al.  Multi-target tracking on confidence maps: An application to people tracking , 2013, Comput. Vis. Image Underst..

[38]  Pascal Fua,et al.  Tracking multiple people under global appearance constraints , 2011, 2011 International Conference on Computer Vision.

[39]  Luc Van Gool,et al.  Robust tracking-by-detection using a detector confidence particle filter , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[40]  GaoYue,et al.  Attribute-Augmented Semantic Hierarchy , 2014 .

[41]  CavallaroAndrea,et al.  Multi-target tracking on confidence maps , 2013 .