Pedestrian detection in images by integrating heterogeneous detectors

Pedestrian Detection in still images is a key problem in computer vision. Traditional approaches design features for representing the holistic human body. Unfortunately, occlusions and articulations pose challenges and degrade their performances. Part-based representations have more potential to solve these problems. However, they tend to produce more false alarms than holistic approaches. This paper proposes a framework to integrate heterogeneous detectors (including holistic, part-based and face detectors) to boost pedestrian detection performance. Responses from heterogeneous detectors cast probability votes using Hough transform and considering geometric relationship of different detectors. Peaks of votes localize where pedestrians are. To avoid false alarms, cell models are learned in advance to evaluate local alignment and to reject wrong detections. Experiments on the INRIA dataset show that our framework provides a better performance than some state-of-the art methods.

[1]  Larry S. Davis,et al.  Bilattice-based Logical Reasoning for Human Detection , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[3]  Mei-Chen Yeh,et al.  Fast Human Detection Using a Cascade of Histograms of Oriented Gradients , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[4]  Rainer Lienhart,et al.  An extended set of Haar-like features for rapid object detection , 2002, Proceedings. International Conference on Image Processing.

[5]  Bernt Schiele,et al.  A Performance Evaluation of Single and Multi-feature People Detection , 2008, DAGM-Symposium.

[6]  Pietro Perona,et al.  Pedestrian detection: A benchmark , 2009, CVPR.

[7]  Fatih Murat Porikli,et al.  Human Detection via Classification on Riemannian Manifolds , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Yuan Li,et al.  High-Performance Rotation Invariant Multiview Face Detection , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Ramakant Nevatia,et al.  Optimizing discrimination-efficiency tradeoff in integrating heterogeneous local features for object detection , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[11]  Narendra Ahuja,et al.  Detecting Faces in Images: A Survey , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Dorin Comaniciu,et al.  Mean Shift: A Robust Approach Toward Feature Space Analysis , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Shuicheng Yan,et al.  An HOG-LBP human detector with partial occlusion handling , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[14]  T. Truong,et al.  Simplified procedure for decoding reed-solomon codes using Euclid’s algorithm and the fast fourier transform over GF(2m) , 2007, TENCON 2007 - 2007 IEEE Region 10 Conference.

[15]  Ramakant Nevatia,et al.  Detection of multiple, partially occluded humans in a single image by Bayesian combination of edgelet part detectors , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[16]  Feng Han,et al.  Discovering class specific composite features through discriminative sampling with Swendsen-Wang Cut , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  Tomaso A. Poggio,et al.  A Trainable System for Object Detection , 2000, International Journal of Computer Vision.

[18]  E. Rückert Detecting Pedestrians by Learning Shapelet Features , 2007 .

[19]  Juergen Gall,et al.  Class-specific Hough forests for object detection , 2009, CVPR.

[20]  Luc Van Gool,et al.  The 2005 PASCAL Visual Object Classes Challenge , 2005, MLCW.

[21]  Larry S. Davis,et al.  Hierarchical Part-Template Matching for Human Detection and Segmentation , 2007, 2007 IEEE 11th International Conference on Computer Vision.