Evaluation of HMM training algorithms for letter hand gesture recognition

The paper introduces an application using computer vision for letter hand gesture recognition. A digital camera records a video stream of hand gestures. The hand is automatically segmented, the position of the hand centroid is calculated in each frame, and a trajectory of the hand is determined. After smoothing the trajectory, a sequence of angles of motion along the trajectory is calculated and quantized to form a discrete observation sequence. Hidden Markov models (HMMs) are used to recognize the letters. Baum Welch and Viterbi path counting algorithms are applied for training the HMMs. Our system recognizes all 26 letters from A to Z and the database contains 30 example videos of each letter gesture. We achieve an average recognition rate of about 90 percent. A motivation for the development of this system is to provide an alternate text input mechanism for camera enabled handheld devices, such as video mobile phones and PDAs.

[1]  Yangsheng Xu,et al.  Gesture interface: modeling and learning , 1994, Proceedings of the 1994 IEEE International Conference on Robotics and Automation.

[2]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[3]  Brian C. Lovell,et al.  MIME: a gesture-driven computer interface , 2000, Visual Communications and Image Processing.

[4]  Paramvir Bahl,et al.  Recognition of handwritten word: First and second order hidden Markov model based approach , 1989, Pattern Recognit..

[5]  Allen Gersho,et al.  Vector quantization and signal compression , 1991, The Kluwer international series in engineering and computer science.

[6]  Andreas Stolcke,et al.  Hidden Markov Model} Induction by Bayesian Model Merging , 1992, NIPS.