Vision-based people detection using depth information for social robots

Robots are starting to be applied in areas which involve sharing space with humans. In particular, social robots and people will coexist closely because the former are intended to interact with the latter. In this context, it is crucial that robots are aware of the presence of people around them. Traditionally, people detection has been performed using a flow of two-dimensional images. However, in nature, animals’ sight perceives their surroundings using color and depth information. In this work, we present new people detectors that make use of the data provided by depth sensors and red-green-blue images to deal with the characteristics of human–robot interaction scenarios. These people detectors are based on previous works using two-dimensional images and existing people detectors from different areas. The disparity of the input and output data used by these types of algorithms usually complicates their integration into robot control architectures. We propose a common interface that can be used by any people detector, resulting in numerous advantages. Several people detectors using depth information and the common interface have been implemented and evaluated. The results show a great diversity among the different algorithms. Each one has a particular domain of use, which is reflected in the results. A clever combination of several algorithms appears as a promising solution to achieve a flexible, reliable people detector.

[1]  Huosheng Hu,et al.  Multisensor-Based Human Detection and Tracking for Mobile Service Robots , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[2]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[3]  Radu Bogdan Rusu,et al.  3D is here: Point Cloud Library (PCL) , 2011, 2011 IEEE International Conference on Robotics and Automation.

[4]  Takeo Kanade,et al.  Neural Network-Based Face Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Gary R. Bradski,et al.  Learning OpenCV - computer vision with the OpenCV library: software that sees , 2008 .

[7]  Xuelong Li,et al.  Efficient HOG human detection , 2011, Signal Process..

[8]  Mei-Chen Yeh,et al.  Fast Human Detection Using a Cascade of Histograms of Oriented Gradients , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[9]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[10]  Gregory Dudek,et al.  A Visual Language for Robot Control and Programming: A Human-Interface Study , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[11]  Ana Paiva,et al.  Modelling empathic behaviour in a robotic game companion for children: An ethnographic study in real-world settings , 2012, 2012 7th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[12]  Rainer Lienhart,et al.  An extended set of Haar-like features for rapid object detection , 2002, Proceedings. International Conference on Image Processing.

[13]  Nico Blodow,et al.  Partial view modeling and validation in 3D laser scans for grasping , 2009, 2009 9th IEEE-RAS International Conference on Humanoid Robots.

[14]  Laura Igual,et al.  Robust gait-based gender classification using depth cameras , 2013, EURASIP Journal on Image and Video Processing.

[15]  Arnaud Ramey Local user mapping via multi-modal fusion for social robots , 2016 .

[16]  Sung-Jea Ko,et al.  Real-time human tracking using fusion sensor for home security robot , 2012, 2012 IEEE International Conference on Consumer Electronics (ICCE).

[17]  Morgan Quigley,et al.  ROS: an open-source Robot Operating System , 2009, ICRA 2009.

[18]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[19]  A. Howard,et al.  Detecting Pedestrians with Stereo Vision: Safe Operation of Autonomous Ground Vehicles in Dynamic Environments , 2007 .

[20]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Bernt Schiele,et al.  Taking a deeper look at pedestrians , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22]  David L. Olson,et al.  Advanced Data Mining Techniques , 2008 .

[23]  María Malfaz,et al.  FAST 3D CLUSTER TRACKING FOR A MOBILE ROBOT USING 2D TECHNIQUES ON DEPTH IMAGES , 2013, Cybern. Syst..

[24]  Antal K. Bejczy,et al.  Distributed microcomputer control system for advanced teleoperation , 1986, Proceedings. 1986 IEEE International Conference on Robotics and Automation.

[25]  Stefano Tubaro,et al.  Deep Convolutional Neural Networks for pedestrian detection , 2015, Signal Process. Image Commun..

[26]  Antonio Fernández-Caballero,et al.  Real-Time Accumulative Computation Motion Detectors , 2009, Sensors.

[27]  Chrystopher L. Nehaniv,et al.  An empirical framework for human-robot proxemics , 2009 .

[28]  Víctor González-Pacheco,et al.  Integration of a low-cost RGB-D sensor in a social robot for gesture recognition , 2011, 2011 6th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[29]  Lucian Cosmin Goron,et al.  Reconstruction and Verification of 3D Object Models for Grasping , 2009, ISRR.

[30]  Luca Maria Gambardella,et al.  Fast image scanning with deep max-pooling convolutional neural networks , 2013, 2013 IEEE International Conference on Image Processing.

[31]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[32]  Bogdan Kwolek,et al.  Human Fall Detection by Mean Shift Combined with Depth Connected Components , 2012, ICCVG.

[33]  Basilio Sierra,et al.  RGB-D, Laser and Thermal Sensor Fusion for People following in a Mobile Robot , 2013 .