A Novel Human Detection Approach Based on Depth Map via Kinect

In this paper, a new method of human detection based on depth map from 3D sensor Kinect is proposed. First, the pixel filtering and context filtering are employed to roughly repair defects on the depth map due to information inaccuracy captured by Kinect. Second, a dataset consisting of depth maps with various indoor human poses is constructed as benchmark. Finally, by introducing Kirsch mask and three-value codes to Local Binary Pattern, a novel Local Ternary Direction Pattern (LTDP) feature descriptor is extracted and is used for human detection with SVM as classifier. The performance for the proposed approach is evaluated and compared with other five existing feature descriptors using the same SVM classifier. Experiment results manifest the effectiveness of the proposed approach.

[1]  Bernt Schiele,et al.  Pedestrian detection in crowded scenes , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[2]  David A. McAllester,et al.  A discriminatively trained, multiscale, deformable part model , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[4]  Milan Sonka,et al.  Image Processing, Analysis and Machine Vision , 1993, Springer US.

[5]  Nikolaos Vidakis,et al.  Fast 3D Scene Object Detection and Real Size Estimation using Microsoft Kinect Sensor , 2012 .

[6]  Thomas S. Huang,et al.  Image processing , 1971 .

[7]  Shuicheng Yan,et al.  An HOG-LBP human detector with partial occlusion handling , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[8]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[9]  Alberto Broggi,et al.  Model-based validation approaches and matching techniques for automotive vision based pedestrian detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops.

[10]  Xiaoyang Tan,et al.  Enhanced Local Texture Feature Sets for Face Recognition Under Difficult Lighting Conditions , 2007, IEEE Transactions on Image Processing.

[11]  Milan Sonka,et al.  Image processing analysis and machine vision [2nd ed.] , 1999 .

[12]  Hironobu Fujiyoshi,et al.  Real-Time Human Detection Using Relational Depth Similarity Features , 2010, ACCV.

[13]  Ramakant Nevatia,et al.  Detection and Tracking of Multiple, Partially Occluded Humans by Bayesian Combination of Edgelet based Part Detectors , 2007, International Journal of Computer Vision.

[14]  Andrew Zisserman,et al.  Representing shape with a spatial pyramid kernel , 2007, CIVR '07.

[15]  Shiqi Yu,et al.  An attempt to pedestrian detection in depth images , 2011, 2011 Third Chinese Conference on Intelligent Visual Surveillance.

[16]  Matti Pietikäinen,et al.  Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Sebastian Thrun,et al.  Real-time identification and localization of body parts from depth images , 2010, 2010 IEEE International Conference on Robotics and Automation.

[18]  James M. Rehg,et al.  CENTRIST: A Visual Descriptor for Scene Categorization , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Jake K. Aggarwal,et al.  Human detection using depth information by Kinect , 2011, CVPR 2011 WORKSHOPS.

[20]  Xuelong Li,et al.  Human Carrying Status in Visual Surveillance , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).