Human head detection using multi-modal object features

This paper describes a neural network system that automatically detects whether a human head exists in a given image. We focus our research in the first two levels of head detections. At the first level, it extracts candidates of a head using range information, motion clue and 3D spherical shape. At the second level, the system uses multiple visual modalities including gray scale value distribution, shape, motion and range information obtained using a stereo vision system to represent head features. A neural network classifier is used to evaluate the effectiveness of various object features for generating and representing human head. The system is validated on a large collection of images taken from a stereo camera system mounted inside a vehicle. Our experiments show the presented system has an accurate rate over 96%.

[1]  Andrew W. Fitzgibbon,et al.  Direct Least Square Fitting of Ellipses , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[3]  Martin Herman,et al.  Head tracking using stereo , 2002, Machine Vision and Applications.

[4]  Alex Pentland,et al.  Modeling, tracking and interactive animation of faces and heads//using input from video , 1996, Proceedings Computer Animation '96.

[5]  Kurt Konolige,et al.  Small Vision Systems: Hardware and Implementation , 1998 .

[6]  Rama Chellappa,et al.  Human and machine recognition of faces: a survey , 1995, Proc. IEEE.

[7]  Narendra Ahuja,et al.  Detecting Faces in Images: A Survey , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Liang Zhao,et al.  Stereo- and neural network-based pedestrian detection , 1999, Proceedings 199 IEEE/IEEJ/JSAI International Conference on Intelligent Transportation Systems (Cat. No.99TH8383).

[9]  Nader Bagherzadeh,et al.  Finding elliptical shapes in an image using a pyramid architecture , 1995 .

[10]  S. Birchfield,et al.  An elliptical head tracker , 1997, Conference Record of the Thirty-First Asilomar Conference on Signals, Systems and Computers (Cat. No.97CB36136).