Development of head detection and tracking systems for visual surveillance

This paper proposes a technique for the detection of head nod and shake gestures based on eye tracking and head motion decision. The eye tracking step is divided into face detection and eye location. Here, we apply a motion segmentation algorithm that examines differences in moving people’s faces. This system utilizes a Hidden Markov Model-based head detection module that carries out complete detection in the input images, followed by the eye tracking module that refines the search based on a candidate list provided by the preprocessing module. The novelty of this paper is derived from differences in real-time input images, preprocessing to remove noises (morphological operators and so on), detecting edge lines and restoration, finding the face area, and cutting the head candidate. Moreover, we adopt a K-means algorithm for finding the head region. Real-time eye tracking extracts the location of eyes from the detected face region and is performed at close to a pair of eyes. After eye tracking, the coordinates of the detected eyes are transformed into a normalized vector of x-coordinate and y-coordinate. Head nod and shake detector uses three hidden Markov models (HMMs). HMM representation of the head detection can estimate the underlying HMM states from a sequence of face images. Head nod and shake can be detected by three HMMs that are adapted by a directional vector. The directional vector represents the direction of the head movement. The vector is HMMs for determining neutral as well as head nod and shake. These techniques are implemented on images, and notable success is notified.

[1]  Kyung-Yong Chung,et al.  Context and profile based cascade classifier for efficient people detection and safety care system , 2012, Multimedia Tools and Applications.

[2]  Alexander Zelinsky,et al.  3-D facial pose and gaze point estimation using a robust real-time tracking paradigm , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[3]  Shinjiro Kawato,et al.  Real-time detection of nodding and head-shaking by directly detecting and tracking the "between-eyes" , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).

[4]  L. Baum,et al.  A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[5]  Monson H. Hayes,et al.  Face detection and recognition using hidden Markov models , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[6]  Wen Gao,et al.  Face recognition under generic illumination based on harmonic relighting , 2005, Int. J. Pattern Recognit. Artif. Intell..

[7]  Petar M. Djuric,et al.  An MCMC sampling approach to estimation of nonstationary hidden Markov models , 2002, IEEE Trans. Signal Process..

[8]  Thomas S. Huang,et al.  Final Report To NSF of the Planning Workshop on Facial Expression Understanding , 1992 .

[9]  Lawrence Carin,et al.  Hidden Markov models for multiaspect target classification , 1999, IEEE Trans. Signal Process..

[10]  Hisham Othman,et al.  Low complexity 2-D Hidden Markov Model for face recognition , 2000, 2000 IEEE International Symposium on Circuits and Systems. Emerging Technologies for the 21st Century. Proceedings (IEEE Cat No.00CH36353).

[11]  Ioannis Pitas,et al.  Parallel Algorithms for Digital Image Processing, Computer Vision and Neural Networks , 1993 .

[12]  Qian Chen,et al.  3D head pose estimation without feature tracking , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[13]  James W. Davis,et al.  A perceptual user interface for recognizing head gesture acknowledgements , 2001, PUI '01.

[14]  L. Kilmartin,et al.  Accurate monitoring of human physical activity levels for medical diagnosis and monitoring using off-the-shelf cellular handsets , 2011, Personal and Ubiquitous Computing.

[15]  Ahmed M. Elgammal,et al.  Face detection in complex environments from color images , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[16]  Julio César Hernández Castro,et al.  A secure distance-based RFID identification protocol with an off-line back-end database , 2012, Personal and Ubiquitous Computing.

[17]  Gang Rong,et al.  A real-time head nod and shake detector using HMMs , 2003, Expert Syst. Appl..

[18]  Woontack Woo,et al.  IMAF: in situ indoor modeling and annotation framework on mobile phones , 2012, Personal and Ubiquitous Computing.

[19]  Quan Pan,et al.  Real-time head tracking system with an active camera , 2004, Fifth World Congress on Intelligent Control and Automation (IEEE Cat. No.04EX788).

[20]  Aneta Poniszewska-Maranda Conception approach of access control in heterogeneous information systems using UML , 2010, Telecommun. Syst..

[21]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[22]  Kentaro Toyama,et al.  “Look, Ma – No Hands!” Hands-Free Cursor Control with Real-Time 3D Face Tracking , 1998 .

[23]  Ashish Kapoor,et al.  A real-time head nod and shake detector , 2001, PUI '01.

[24]  Brigitte Meillon,et al.  Design and evaluation of a smart home voice interface for the elderly: acceptability and objection aspects , 2011, Personal and Ubiquitous Computing.

[25]  Ioannis Pitas,et al.  Digital Image Processing Algorithms , 1993 .

[26]  Kee-Wook Rim,et al.  Localized Approximation Method Using Inertial Compensation in WSNs , 2011, ACIIDS Posters.

[27]  Linda G. Shapiro,et al.  A new connected components algorithm for virtual memory computers , 1983, Comput. Vis. Graph. Image Process..

[28]  Tieniu Tan,et al.  A survey on visual surveillance of object motion and behaviors , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).