Human Detection in Indoor Environments Using Multiple Visual Cues and a Mobile Robot

In order to deploy mobile robots in social environments like indoor buildings, they need to be provided with perceptual abilities to detect people. In the computer vision literature the most typical solution to this problem is based on background subtraction techniques, however, in the case of a mobile robot this is not a viable solution. This paper shows an approach to robustly detect people in indoor environments using a mobile platform. The approach uses a stereo vision system that yields a stereo pair from which a disparity image is obtained. From this disparity image, interesting objects or blobs are segmented using a region growing algorithm. Afterwards, a color segmentation algorithm is performed on each blob, searching for human skin color areas. Finally, a probabilistic classifier provides information to decide if a given skin region corresponds to a human. We test the approach by mounting the resulting system on a mobile robot that navigates in an office type indoor building. We test the system under real time operation and different illumination conditions. The results indicate human detection accuracies over 90% in our test.

[1]  Neeti A. Ogale,et al.  A survey of techniques for human detection from video , 2006 .

[2]  Horst Bischof,et al.  Human detection in groups using a fast mean shift procedure , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[3]  H. Sidenbladh,et al.  Detecting human motion with support vector machines , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[4]  W. Grimson Why Stereo Vision is Not Always About 3D Reconstruction , 1993 .

[5]  Roland Siegwart,et al.  Hybrid simultaneous localization and map building: a natural integration of topological and metric , 2003, Robotics Auton. Syst..

[6]  Reid G. Simmons,et al.  Recent progress in local and global traversability for planetary rovers , 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).

[7]  Alex Pentland,et al.  Pfinder: Real-Time Tracking of the Human Body , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Akira Utsumi,et al.  Human detection using geometrical pixel value structures , 2002, Proceedings of Fifth IEEE International Conference on Automatic Face Gesture Recognition.

[9]  Illah R. Nourbakhsh,et al.  A survey of socially interactive robots , 2003, Robotics Auton. Syst..

[10]  Trevor Darrell,et al.  Integrated Person Tracking Using Stereo, Color, and Pattern Detection , 2000, International Journal of Computer Vision.

[11]  Ming-Kuei Hu,et al.  Visual pattern recognition by moment invariants , 1962, IRE Trans. Inf. Theory.

[12]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[13]  Jan Flusser,et al.  Pattern recognition by affine moment invariants , 1993, Pattern Recognit..

[14]  R.M. Haralick,et al.  Statistical and structural approaches to texture , 1979, Proceedings of the IEEE.

[15]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[16]  Kurt Konolige,et al.  Small Vision Systems: Hardware and Implementation , 1998 .

[17]  Paul A. Viola,et al.  Detecting Pedestrians Using Patterns of Motion and Appearance , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.