Gesture interface: modeling and learning

This paper presents a method for developing a gesture-based system using a multidimensional hidden Markov model (HMM). Instead of using geometric features, gestures are converted into sequential symbols. HMMs are employed to represent the gestures and their parameters are learned from the training data. Based on "the most likely performance" criterion, the gestures can be recognized by evaluating the trained HMMs. We have developed a prototype to demonstrate the feasibility of the proposed method. The system achieved 99.78% accuracy for a 9 gesture isolated recognition task. Encouraging results were also obtained from experiments of continuous gesture recognition. The proposed method is applicable to any multidimensional signal representation gesture, and will be a valuable tool in telerobotics and human computer interfacing.<<ETX>>

[1]  L. Baum,et al.  A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[2]  Robert F. Sproull,et al.  Principles in interactive computer graphics , 1973 .

[3]  R.W. Schafer,et al.  Digital representations of speech signals , 1975, Proceedings of the IEEE.

[4]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[5]  Margaret Minsky,et al.  Manipulating simulated objects with real-world gestures using a force and position sensitive screen , 1984, SIGGRAPH.

[6]  R. Gray,et al.  Vector quantization , 1984, IEEE ASSP Magazine.

[7]  Blake Hannaford,et al.  Hidden Markov Model Analysis of Force/ Torque Information in Telemanipulation , 1989, ISER.

[8]  Tomomasa Sato,et al.  Motion Understanding for World Model Management of Telerobot , 1989 .

[9]  Hsiao-Wuen Hon,et al.  An overview of the SPHINX speech recognition system , 1990, IEEE Trans. Acoust. Speech Signal Process..

[10]  James S. Lipscomb A trainable gesture recognizer , 1991, Pattern Recognit..

[11]  Biing-Hwang Juang,et al.  Hidden Markov Models for Speech Recognition , 1991 .

[12]  Dean Rubine,et al.  The automatic recognition of gestures , 1992 .

[13]  Thomas H. Speeter Transforming Human Hand Motion for Telemanipulation , 1992, Presence: Teleoperators & Virtual Environments.

[14]  F. Hlawatsch,et al.  Linear and quadratic time-frequency signal representations , 1992, IEEE Signal Processing Magazine.

[15]  Geoffrey E. Hinton,et al.  Glove-Talk: a neural network interface between a data-glove and a speech synthesizer , 1993, IEEE Trans. Neural Networks.

[16]  Yangsheng Xu,et al.  Hidden Markov model approach to skill learning and its application in telerobotics , 1993, [1993] Proceedings IEEE International Conference on Robotics and Automation.

[17]  Dana H. Ballard,et al.  Recognizing teleoperated manipulations , 1993, [1993] Proceedings IEEE International Conference on Robotics and Automation.

[18]  Yangsheng Xu,et al.  Hidden Markov model approach to skill learning and its application to telerobotics , 1993, IEEE Trans. Robotics Autom..

[19]  Katsushi Ikeuchi,et al.  Assembly plan from observation , 1995, Proceedings of 1995 Japan International Electronic Manufacturing Technology Symposium.