Online pedestrian tracking via saliency-based H-S histogram

Object tracking is one of the most important topics in computer vision. While the state-of-the-art tracking algorithms achieved great success, there are still some challenging problems to be solved. Firstly, it remains a tough task to develop a tracking algorithm with both accuracy and efficiency. Secondly, the ground truth is often given by a rectangular bounding box, which contains not only the target, but also the background pixels. The background pixels in the initial bounding box will mislead the appearance model of the target. Thirdly, most features for representing the target only use the gray scale information, and are not robust to pedestrians. The third problem is especially serious when the targets are pedestrians, which have colorful cloths on them, and change their pose and shape while walking. In this paper, we propose a novel saliency-based H-S histogram algorithm. Saliency detection can efficiently delete most of the background pixels, which makes the appearance model more accurate. H-S histogram is a statistic-based feature, which is more robust to shape deformation, and the color information is considered in the H-S channels. Experiments on Caltech pedestrian database show the proposed method can handle some hard cases, and achieves a higher success rate.

[1]  Huchuan Lu,et al.  Visual tracking via adaptive structural local sparse appearance model , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Hubert Konik,et al.  Full Reference Image Quality Assessment Based on Saliency Map Analysis , 2010 .

[3]  Huchuan Lu,et al.  Saliency Detection via Absorbing Markov Chain , 2013, 2013 IEEE International Conference on Computer Vision.

[4]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[5]  Yi Wu,et al.  Online Object Tracking: A Benchmark , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Huchuan Lu,et al.  Robust object tracking via sparsity-based collaborative model , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Touradj Ebrahimi,et al.  The JPEG2000 still image coding system: an overview , 2000, IEEE Trans. Consumer Electron..

[8]  Jiri Matas,et al.  P-N learning: Bootstrapping binary classifiers by structural constraints , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[9]  Pietro Perona,et al.  Is bottom-up attention useful for object recognition? , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[10]  David W. Jacobs,et al.  In search of illumination invariants , 2001, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[11]  Hubert Konik,et al.  A Spatiotemporal Saliency Model for Video Surveillance , 2011, Cognitive Computation.

[12]  Nanning Zheng,et al.  Automatic salient object extraction with contextual cue , 2011, 2011 International Conference on Computer Vision.

[13]  S Ullman,et al.  Shifts in selective visual attention: towards the underlying neural circuitry. , 1985, Human neurobiology.

[14]  Pietro Perona,et al.  Is bottom-up attention useful for object recognition? , 2004, CVPR 2004.

[15]  Hubert Konik,et al.  Predictive Saliency Maps for Surveillance Videos , 2010, 2010 Ninth International Symposium on Distributed Computing and Applications to Business, Engineering and Science.

[16]  Michael Isard,et al.  CONDENSATION—Conditional Density Propagation for Visual Tracking , 1998, International Journal of Computer Vision.

[17]  Ming-Hsuan Yang,et al.  Robust Object Tracking with Online Multiple Instance Learning , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Fatih Murat Porikli,et al.  Integral histogram: a fast way to extract histograms in Cartesian spaces , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).