Head Detection in Stereo Data for People Counting and Segmentation

In this paper we propose a head detection method using range data from a stereo camera. The method is based on a technique that has been introduced in the domain of voxel data. For application in stereo cameras, the technique is extended (1) to be applicable to stereo data, and (2) to be robust with regard to noise and variation in environmental settings. The method consists of foreground selection, head detection, and blob separation, and, to improve results in case of misdetections, incorporates a means for people tracking. It is tested in experiments with actual stereo data, gathered from three distinct real-life scenarios. Experimental results show that the proposed method performs well in terms of both precision and recall. In addition, the method was shown to perform well in highly crowded situations. From our results, we may conclude that the proposed method provides a strong basis for head detection in applications that utilise stereo cameras.

[1]  Trevor Darrell,et al.  Integrated Person Tracking Using Stereo, Color, and Pattern Detection , 2000, International Journal of Computer Vision.

[2]  T. Izumi,et al.  Improvement of Head Extraction for Height Measurement by Combination of Sphere Matching and Optical Flow , 2006, 2006 SICE-ICASE International Joint Conference.

[3]  Mohan M. Trivedi,et al.  Human Body Model Acquisition and Tracking Using Voxel Data , 2003, International Journal of Computer Vision.

[4]  Manabu Hashimoto,et al.  Multiple-person tracker with a fixed slanting stereo camera , 2004, Sixth IEEE International Conference on Automatic Face and Gesture Recognition, 2004. Proceedings..

[5]  Y. J. Tejwani,et al.  Robot vision , 1989, IEEE International Symposium on Circuits and Systems,.

[6]  Z. Zivkovic Improved adaptive Gaussian mixture model for background subtraction , 2004, ICPR 2004.

[7]  Jake K. Aggarwal,et al.  Head segmentation and head orientation in 3D space for pose estimation of multiple people , 2000, 4th IEEE Southwest Symposium on Image Analysis and Interpretation.

[8]  David Beymer,et al.  Person counting using stereo , 2000, Proceedings Workshop on Human Motion.

[9]  Ramakant Nevatia,et al.  Bayesian human segmentation in crowded situations , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[10]  Liyuan Li,et al.  Stereo-based human head detection from crowd scenes , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[11]  Alan F. Smeaton,et al.  Robust pedestrian detection and tracking in crowded scenes , 2009, Image Vis. Comput..

[12]  W. Eric L. Grimson,et al.  Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[13]  Kazuhiko Yamamoto,et al.  Face and head detection for a real-time surveillance system , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[14]  Christian Wöhler,et al.  Motion-based recognition of pedestrians , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[15]  D. Scharstein,et al.  A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms , 2001, Proceedings IEEE Workshop on Stereo and Multi-Baseline Vision (SMBV 2001).

[16]  Yan Guo,et al.  Real-time stereo tracking of multiple moving heads , 2001, Proceedings IEEE ICCV Workshop on Recognition, Analysis, and Tracking of Faces and Gestures in Real-Time Systems.