Pair normalized channel feature and statistics-based learning for high-performance pedestrian detection

High-performance pedestrian detection with good accuracy and fast speed is an important yet challenging task in computer vision. We design a novel feature named pair normalized channel feature (PNCF), which simultaneously combines and normalizes two channel features in image channels, achieving a highly discriminative power and computa- tional efficiency. PNCF applies to both gradient channels and color channels so that shape and appearance information are described and integrated in the same feature. To efficiently explore the formidably large PNCF feature space, we propose a statistics-based feature learning method to select a small number of potentially discriminative candidate features, which are fed into the boosting algorithm. In addition, channel compression and a hybrid pyramid are employed to speed up the multi- scale detection. Experiments illustrate the effectiveness of PNCF and its learning method. Our proposed detector outperforms the state-of-the-art on several benchmark datasets in both detection accuracy and efficiency. © 2012 Society of Photo-Optical Instrumentation Engineers (SPIE). (DOI: 10.1117/1.OE.51.7 .077206)

[1]  Yair Weiss,et al.  Learning object detection from a small number of examples: the importance of good features , 2004, CVPR 2004.

[2]  Alexei A. Efros,et al.  Putting Objects in Perspective , 2006, CVPR.

[3]  Pietro Perona,et al.  The Fastest Pedestrian Detector in the West , 2010, BMVC.

[4]  Christoph H. Lampert,et al.  Efficient Subwindow Search: A Branch and Bound Framework for Object Localization , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Larry S. Davis,et al.  Multiple instance fFeature for robust part-based object detection , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Javier R. Movellan,et al.  Optimal scanning for faster object detection , 2009, CVPR.

[7]  Larry S. Davis,et al.  Multiple instance fFeature for robust part-based object detection , 2009, CVPR.

[8]  Rita Cucchiara,et al.  Multistage Particle Windows for Fast and Accurate Object Detection , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Shuicheng Yan,et al.  An HOG-LBP human detector with partial occlusion handling , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[10]  Fatih Murat Porikli,et al.  Human Detection via Classification on Riemannian Manifolds , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Navneet Dalal,et al.  Finding People in Images and Videos , 2006 .

[12]  Pietro Perona,et al.  Integral Channel Features , 2009, BMVC.

[13]  Shihong Lao,et al.  Adaptive Contour Features in oriented granular space for human detection and segmentation , 2009, CVPR.

[14]  Tat-Jen Cham,et al.  Fast training and selection of Haar features using statistics in boosting-based face detection , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[15]  Ramakant Nevatia,et al.  High performance object detection by collaborative learning of Joint Ranking of Granules features , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[16]  Bernt Schiele,et al.  Multi-cue onboard pedestrian detection , 2009, CVPR.

[17]  R. Tibshirani,et al.  Additive Logistic Regression : a Statistical View ofBoostingJerome , 1998 .

[18]  Bernt Schiele,et al.  A Performance Evaluation of Single and Multi-feature People Detection , 2008, DAGM-Symposium.

[19]  Rita Cucchiara,et al.  Covariance descriptors on moving regions for human detection in very complex outdoor scenes , 2009, 2009 Third ACM/IEEE International Conference on Distributed Smart Cameras (ICDSC).

[20]  Jian Yao,et al.  Fast human detection from videos using covariance features , 2008, ECCV 2008.

[21]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[22]  Hironobu Fujiyoshi,et al.  Feature co-occurrence representation based on boosting for object detection , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[23]  Pietro Perona,et al.  Pedestrian Detection: An Evaluation of the State of the Art , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Miley W. Merkhofer,et al.  An Evaluation of the State of the Art , 1993 .

[25]  Dariu Gavrila,et al.  Monocular Pedestrian Detection: Survey and Experiments , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Hyung Jeong Yang,et al.  Recursive Coarse-to-Fine Localization for Fast Object Detection , 2014 .

[28]  D. Hinkley On the ratio of two correlated normal random variables , 1969 .

[29]  Bernt Schiele,et al.  New features and insights for pedestrian detection , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[30]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[31]  Luc Van Gool,et al.  Depth and Appearance for Mobile Scene Analysis , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[32]  Ramakant Nevatia,et al.  Detection and Tracking of Multiple, Partially Occluded Humans by Bayesian Combination of Edgelet based Part Detectors , 2007, International Journal of Computer Vision.

[33]  Lior Rokach,et al.  Ensemble-based classifiers , 2010, Artificial Intelligence Review.

[34]  Luc Van Gool,et al.  Robust Multiperson Tracking from a Mobile Platform , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.