Kinect‐Based Pedestrian Detection for Crowded Scenes

Pedestrian movement data including volumes, walking speeds, and trajectories are essential in transportation engineering, planning, and research. Although traditional image-based pedestrian detectors provide very rich information, their performance degrades quickly with increased occurrence of occlusion. The three-dimensional sensing capabilities of Microsoft's Kinect present a potential cost-effective solution for occlusion-robust pedestrian detection. This article proposes an efficient pedestrian detection approach for crowded scenes by fusing RGB and depth images from the Kinect. More specifically, we first extract the pedestrian contour regions from RGB images using background subtraction. Then, we develop a region clustering algorithm to extract pedestrians from the contour regions using depth information. Finally, a tracking and counting algorithm is designed to acquire pedestrian volumes. The proposed approach was proven effective with an average detection accuracy of 93.1% at 20 frames per second. These results demonstrate the feasibility of using the low-cost Kinect device for real-world pedestrian detection in crowded scenes.

[1]  Kai Oliver Arras,et al.  People tracking in RGB-D data with on-line boosted target models , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[2]  Bernt Schiele,et al.  Pedestrian detection in crowded scenes , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[3]  Yinhai Wang,et al.  Model‐Free Video Detection and Tracking of Pedestrians and Bicyclists , 2009, Comput. Aided Civ. Infrastructure Eng..

[4]  Z. Zivkovic Improved adaptive Gaussian mixture model for background subtraction , 2004, ICPR 2004.

[5]  Carlo Ratti,et al.  Kinects and human kinetics: a new approach for studying pedestrian behavior , 2014 .

[6]  Shiqi Yu,et al.  An attempt to pedestrian detection in depth images , 2011, 2011 Third Chinese Conference on Intelligent Visual Surveillance.

[7]  Jake K. Aggarwal,et al.  Human detection using depth information by Kinect , 2011, CVPR 2011 WORKSHOPS.

[8]  Matteo Munaro,et al.  Fast and Robust Multi-people Tracking from RGB-D Data for a Mobile Robot , 2012, IAS.

[9]  Tarak Gandhi,et al.  Pedestrian Protection Systems: Issues, Survey, and Challenges , 2007, IEEE Transactions on Intelligent Transportation Systems.

[10]  L-Q Xu,et al.  Segmentation and Tracking of Multiple Moving Objects for Intelligent Video Analysis , 2004 .

[11]  Kai Oliver Arras,et al.  People detection in RGB-D data , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[12]  J. Pucher,et al.  Cycling to work in 90 large American cities: new evidence on the role of bike paths and lanes , 2012 .

[13]  Fatemeh Tahavori,et al.  A quantitative assessment of using the Kinect for Xbox 360 for respiratory surface motion tracking , 2012, Medical Imaging.

[14]  Carlo Ratti,et al.  Kinects and Human Kinetics: A New Approach for Studying Crowd Behavior , 2012, ArXiv.

[15]  Nicolas Saunier,et al.  Analysis of Pedestrian Travel with Static Bluetooth Sensors , 2012 .

[16]  Keiichi Abe,et al.  Topological structural analysis of digitized binary images by border following , 1985, Comput. Vis. Graph. Image Process..

[17]  Kwang Hee Won,et al.  Pedestrian detection using labeled depth data , 2013, The 19th Korea-Japan Joint Workshop on Frontiers of Computer Vision.

[18]  Cristiano Premebida,et al.  Pedestrian detection in far infrared images , 2013, Integr. Comput. Aided Eng..

[19]  Pietro Perona,et al.  Pedestrian Detection: An Evaluation of the State of the Art , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Ricardo Gutierrez-Osuna,et al.  Web GIS in practice X: a Microsoft Kinect natural user interface for Google Earth navigation , 2011, International journal of health geographics.

[21]  Silvio Savarese,et al.  Detecting and tracking people using an RGB-D camera via multiple detector fusion , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[22]  Carlo Tomasi,et al.  People Detection Using Color and Depth Images , 2011, MCPR.

[23]  Song Wu,et al.  A Configurable Agent‐Based Crowd Model with Generic Behavior Effect Representation Mechanism , 2014, Comput. Aided Civ. Infrastructure Eng..

[24]  A. Dannenberg,et al.  Walking and cycling to health: a comparative analysis of city, state, and international data. , 2010, American journal of public health.

[25]  Alain Trémeau,et al.  A region growing and merging algorithm to color segmentation , 1997, Pattern Recognit..

[26]  Ying Cui,et al.  Real-time human detection and tracking in complex environments using single RGBD camera , 2013, 2013 IEEE International Conference on Image Processing.

[27]  Jennifer Dill Bicycling for Transportation and Health: The Role of Infrastructure , 2009, Journal of public health policy.

[28]  Ching-Tang Hsieh,et al.  A Kinect-based people-flow counting system , 2012, 2012 International Symposium on Intelligent Signal Processing and Communications Systems.

[29]  Michael Hayes,et al.  Altitude control of a quadrotor helicopter using depth map from Microsoft Kinect sensor , 2011, 2011 IEEE International Conference on Mechatronics.

[30]  Erik D. Goodman,et al.  Integrating a statistical background- foreground extraction algorithm and SVM classifier for pedestrian detection and tracking , 2013, Integr. Comput. Aided Eng..