Improving pedestrian detection with selective gradient self-similarity feature

Gradient features play important roles for the problem of pedestrian detection, especially the histogram of oriented gradients (HOG) feature. To improve detection accuracy in terms of feature extraction, HOG has been combined with multiple kinds of low-level features. However, it is still possible to exploit further discriminative information from the classical HOG feature. Inspired by the symmetrical characteristic of pedestrian appearance, we present a novel feature of gradient self-similarity (GSS) in this work. GSS is computed from HOG, and is applied to capturing the patterns of pairwise similarities of local gradient patches. Furthermore, a supervised feature selection approach is employed to remove the non-informative pairs. As a result, the selective GSS (SGSS) feature is built on a concise subset of pair comparisons. The experimental results demonstrate that significant improvement is achieved by incorporating HOG with GSS/SGSS. In addition, considering that HOG is a prerequisite for GSS/SGSS, it is intuitional to develop a two-level cascade of classifiers for obtaining improved detection performance. Specifically, the first level is a linear SVM with the multiscale HOG features to efficiently remove easy negatives. At the second stage, the already computed HOG features are reused to produce the corresponding GSS/SGSS features, and then the combined features are used to discriminate true positives from candidate image regions. Although simple, this model is competitive with the state-of-the-art methods on the well-known datasets. HighlightsWe propose the selective gradient self-similarity (SGSS) feature.SGSS captures pairwise similarity patterns of local gradient distributions.SGSS is a mid-level feature on top of HOG and is complementary to HOG.Addition of SGSS gives a significant boost to pedestrian detection accuracy.The AdaBoost-based cascade with HOG-SGSS outperforms the linear SVM/HIKSVM.

[1]  J. Friedman Special Invited Paper-Additive logistic regression: A statistical view of boosting , 2000 .

[2]  Pietro Perona,et al.  The Fastest Pedestrian Detector in the West , 2010, BMVC.

[3]  Larry S. Davis,et al.  A Pose-Invariant Descriptor for Human Detection and Segmentation , 2008, ECCV.

[4]  Andrew Zisserman,et al.  Multiple kernels for object detection , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[5]  Deva Ramanan,et al.  Histograms of Sparse Codes for Object Detection , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Ian T. Jolliffe,et al.  Principal Component Analysis , 2002, International Encyclopedia of Statistical Science.

[7]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[8]  Ramakant Nevatia,et al.  Detection of multiple, partially occluded humans in a single image by Bayesian combination of edgelet part detectors , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[9]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Tae-Kyun Kim,et al.  Fast Pedestrian Detection by Cascaded Random Forest with Dominant Orientation Templates , 2012, BMVC.

[11]  Subhransu Maji,et al.  Classification using intersection kernel support vector machines is efficient , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Yann LeCun,et al.  Pedestrian Detection with Unsupervised Multi-stage Feature Learning , 2012, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[14]  Chunhua Shen,et al.  Pyramid Center-Symmetric Local Binary/Trinary Patterns for Effective Pedestrian Detection , 2010, ACCV.

[15]  Dariu Gavrila,et al.  A Multilevel Mixture-of-Experts Framework for Pedestrian Classification , 2011, IEEE Transactions on Image Processing.

[16]  Greg Mori,et al.  Detecting Pedestrians by Learning Shapelet Features , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  Luc Van Gool,et al.  Depth and Appearance for Mobile Scene Analysis , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[18]  Jian Zhang,et al.  Fast Pedestrian Detection Using a Cascade of Boosted Covariance Features , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[19]  Wei Huang,et al.  Detection and tracking of multiple moving objects in video , 2007, VISAPP.

[20]  Ramakant Nevatia,et al.  Detection and Tracking of Multiple, Partially Occluded Humans by Bayesian Combination of Edgelet based Part Detectors , 2007, International Journal of Computer Vision.

[21]  Xiaogang Wang,et al.  Modeling Mutual Visibility Relationship in Pedestrian Detection , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[22]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[23]  Ivor W. Tsang,et al.  Learning Sparse SVM for Feature Selection on Very High Dimensional Datasets , 2010, ICML.

[24]  Larry S. Davis,et al.  Human detection using partial least squares analysis , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[25]  Chunhua Shen,et al.  Effective Pedestrian Detection Using Center-symmetric Local Binary/Trinary Patterns , 2010, ArXiv.

[26]  Bernt Schiele,et al.  New features and insights for pedestrian detection , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[27]  Shuicheng Yan,et al.  An HOG-LBP human detector with partial occlusion handling , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[29]  M. Barker,et al.  Partial least squares for discrimination , 2003 .

[30]  Pierre Vandergheynst,et al.  FREAK: Fast Retina Keypoint , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[31]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[32]  David Gerónimo Gómez,et al.  Survey of Pedestrian Detection for Advanced Driver Assistance Systems , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33]  Jonathan Warrell,et al.  Proposal generation for object detection using cascaded ranking SVMs , 2011, CVPR 2011.

[34]  Cordelia Schmid,et al.  Human Detection Using Oriented Histograms of Flow and Appearance , 2006, ECCV.

[35]  Pietro Perona,et al.  Integral Channel Features , 2009, BMVC.

[36]  Xiaogang Wang,et al.  Multi-stage Contextual Deep Learning for Pedestrian Detection , 2013, 2013 IEEE International Conference on Computer Vision.

[37]  David A. McAllester,et al.  Cascade object detection with deformable part models , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[38]  Joseph J. Lim,et al.  Sketch Tokens: A Learned Mid-level Representation for Contour and Object Detection , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[39]  Paul A. Viola,et al.  Robust Real-time Object Detection , 2001 .

[40]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[41]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[42]  Pietro Perona,et al.  Pedestrian Detection: An Evaluation of the State of the Art , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[43]  Pietro Perona,et al.  Pedestrian detection: A benchmark , 2009, CVPR.

[44]  Paul A. Viola,et al.  Detecting Pedestrians Using Patterns of Motion and Appearance , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[45]  Serge J. Belongie,et al.  Integral Channel Features - Addendum , 2009 .

[46]  Xiaogang Wang,et al.  A discriminative deep model for pedestrian detection with occlusion handling , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[47]  Bernt Schiele,et al.  Multi-cue onboard pedestrian detection , 2009, CVPR.