View-invariant, partially occluded human detection in still images using part bases and random forest

Abstract. This paper presents a part-based human detection method that is invariant to variations in the view of the human and partial occlusion by other objects. First, to address the view variance, parts are extracted from three views: frontal-rear, left profile, and right profile. Then a random set of rectangular parts are extracted from the upper, middle, and lower body as the distribution of Gaussian. Second, an individual part classifier is constructed using random forests across all parts extracted from the three views. From the part locations of each view, part vectors (PVs) are generated and part bases (PB) are also formalized by clustering PVs with their weights of each PB. For testing, a PV for the frontal-rear view is estimated using trained part detectors and is then applied to the trained PB for each view class. Then the distance is computed between the PB and PVs. After applying the same process to the other two views, the final human and its view having the minimum score are selected. The proposed method is applied to pedestrian datasets and its detection precision is, on average, 0.14 higher than related methods, while achieving a faster or comparable processing time with an average of 1.85 s per image.

[1]  Ramakant Nevatia,et al.  Detection of multiple, partially occluded humans in a single image by Bayesian combination of edgelet part detectors , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[2]  Hyeran Byun,et al.  Method to improve efficiency of human detection using scalemap , 2014 .

[3]  ByoungChul Ko,et al.  Fast Human Detection for Intelligent Monitoring Using Surveillance Visible Sensors , 2014, Sensors.

[4]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Xudong Jiang,et al.  Human Detection by Quadratic Classification on Subspace of Extended Histogram of Gradients , 2014, IEEE Transactions on Image Processing.

[6]  Hyeran Byun,et al.  Robust Face Detection and Tracking for Real-Life Applications , 2003, Int. J. Pattern Recognit. Artif. Intell..

[7]  Shihong Lao,et al.  Multiple Human Tracking Based on Multi-view Upper-Body Detection and Discriminative Learning , 2010, 2010 20th International Conference on Pattern Recognition.

[8]  Subhransu Maji,et al.  Detecting People Using Mutually Consistent Poselet Activations , 2010, ECCV.

[9]  Fatih Murat Porikli,et al.  Human Detection via Classification on Riemannian Manifolds , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[11]  Cordelia Schmid,et al.  Learning to Parse Pictures of People , 2002, ECCV.

[12]  Grantham Pang,et al.  Human detection in crowded scenes , 2010, 2010 IEEE International Conference on Image Processing.

[13]  Miley W. Merkhofer,et al.  An Evaluation of the State of the Art , 1993 .

[14]  Shuicheng Yan,et al.  Discriminative local binary patterns for human detection in personal album , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  ByoungChul Ko,et al.  Spatiotemporal bag-of-features for early wildfire smoke detection , 2013, Image Vis. Comput..

[16]  Yi Yang,et al.  Articulated Human Detection with Flexible Mixtures of Parts , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Ramakant Nevatia,et al.  Cluster Boosted Tree Classifier for Multi-View, Multi-Pose Object Detection , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[18]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[19]  Wanqing Li,et al.  A part-based template matching method for multi-view human detection , 2009, 2009 24th International Conference Image and Vision Computing New Zealand.

[20]  Stefan Roth,et al.  People-tracking-by-detection and people-detection-by-tracking , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[21]  Vittorio Murino,et al.  Part-based human detection on Riemannian manifolds , 2010, 2010 IEEE International Conference on Image Processing.

[22]  Mei-Chen Yeh,et al.  Fast Human Detection Using a Cascade of Histograms of Oriented Gradients , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[23]  Tomaso A. Poggio,et al.  A Trainable System for Object Detection , 2000, International Journal of Computer Vision.

[24]  Raphaël Marée,et al.  Random subwindows for robust image classification , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[25]  Byoung Chul Ko,et al.  Three-level cascade of random forests for rapid human detection , 2013 .

[26]  ByoungChul Ko,et al.  Human tracking in thermal images using adaptive particle filters with online random forest learning , 2013 .

[27]  Pietro Perona,et al.  The Fastest Pedestrian Detector in the West , 2010, BMVC.

[28]  Silvio Savarese,et al.  Articulated part-based model for joint object detection and pose estimation , 2011, 2011 International Conference on Computer Vision.

[29]  Bernt Schiele,et al.  Robust Object Detection with Interleaved Categorization and Segmentation , 2008, International Journal of Computer Vision.

[30]  Seong-Hoon Kim,et al.  X-ray Image Classification Using Random Forests with Local Wavelet-Based CS-Local Binary Patterns , 2011, Journal of Digital Imaging.

[31]  Leonidas J. Guibas,et al.  Human action recognition by learning bases of action attributes and parts , 2011, 2011 International Conference on Computer Vision.