A Real-Time Demonstrator for Video-Based Recognition of Dynamic Head Gestures, Using Discrete Hidden Markov Models

This work describes a powerful demonstrator of a videobased approach for detecting and classifying dynamic head gestures. The head of the user is localized via a combination of colorand shape-based segmentation. For a continuous feature extraction, we use a template matching of the nose bridge in combination with selected features derived from the optical flow. The core classification unit consists of discrete Hidden Markov Models (DHMMs). We extensively tested the system in two different domains (desktop VirtualReality and automotive environment). In the current state of development, six different gestures can be classified with an overall recognition rate of 97.3% in the VR, and 95.5% in the automotive environment, respectively. The approach works absolutely independent from the image background and additional gesture types can easily be integrated.

[1]  Narendra Ahuja,et al.  Detecting Faces in Images: A Survey , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  David J. Fleet,et al.  Performance of optical flow techniques , 1994, International Journal of Computer Vision.

[3]  Peter Morguet,et al.  Spotting dynamic hand gestures in video image sequences using hidden Markov models , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[4]  Gerhard Rigoll,et al.  Evaluating Multimodal Interaction Patterns in Various Application Scenarios , 2003, Gesture Workshop.

[5]  Ryohei Nakatsu,et al.  A Head Gesture Recognition Algorithm , 2000, ICMI.

[6]  Larry S. Davis,et al.  Recognition of head gestures using hidden Markov models , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[7]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[8]  Sharon L. Oviatt,et al.  Multimodal interface research: a science without borders , 2000, INTERSPEECH.

[9]  James W. Davis,et al.  A perceptual user interface for recognizing head gesture acknowledgements , 2001, PUI '01.