Human detection using depth and gray images

A method is presented for extracting pedestrian information from an image sequence taken by a monocular camera. The method makes use of hybrid sensing of depth and gray information and it is shown to work well in an indoor environment. A split-and-merge strategy is proposed to process depth data for object and human detection. Furthermore, human tracking and event detection are also presented to recognize simple behavior such as hand-shaking. This method does not use background subtraction, and therefore it is applicable for scenes taken from mobile platforms. Experimental results are presented to validate our approach.

[1]  Dariu Gavrila,et al.  The Visual Analysis of Human Movement: A Survey , 1999, Comput. Vis. Image Underst..

[2]  Alex Pentland,et al.  Pfinder: real-time tracking of the human body , 1996, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition.

[3]  Larry S. Davis,et al.  W4: Real-Time Surveillance of People and Their Activities , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Martin Herman,et al.  Head tracking using stereo , 2002, Machine Vision and Applications.

[5]  W. Eric L. Grimson,et al.  Similarity templates for detection and recognition , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[6]  Azriel Rosenfeld,et al.  Visual surveillance and monitoring , 1998 .

[7]  Ismail Haritaoglu W(4): A real-time system for detection and tracking of people and monitoring their activities , 1999 .

[8]  A. M. Tekalp,et al.  Multiple camera fusion for multi-object tracking , 2001, Proceedings 2001 IEEE Workshop on Multi-Object Tracking.

[9]  Pascal Fua,et al.  Modeling People Toward Vision-Based Understanding of a Person's Shape, Appearance, and Movement , 2001, Comput. Vis. Image Underst..

[10]  W. Eric L. Grimson,et al.  Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[11]  Trevor Darrell,et al.  Integrated Person Tracking Using Stereo, Color, and Pattern Detection , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[12]  Liang Zhao,et al.  Stereo- and neural network-based pedestrian detection , 1999, Proceedings 199 IEEE/IEEJ/JSAI International Conference on Intelligent Transportation Systems (Cat. No.99TH8383).

[13]  Tomaso A. Poggio,et al.  A general framework for object detection , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[14]  Stephen J. McKenna,et al.  Tracking interacting people , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).

[15]  Aaron F. Bobick,et al.  Closed-world tracking , 1995, Proceedings of IEEE International Conference on Computer Vision.

[16]  Dariu Gavrila,et al.  Real-time object detection for "smart" vehicles , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.