Hidden Markov models for spatio-temporal pattern recognition

The success of many real-world applications demonstrates that hidden Markov models(HMMs) are highly effective in one-dimensional pattern recognition problems such as speech recognition. Research is now focussed on extending HMMs to 2-D and possibly 3-D applications which arise in gesture, face, and handwriting recognition. Although the HMM has become a major workhorse of the pattern recognition community, there are few analytical results which can explain its remarkably good pattern recognition performance. There are also only a few theoretical principles for guiding researchers in selecting topologies or understanding how the model parameters contribute to performance. In this chapter, we deal with these issues and use simulated data to evaluate the performance of a number of alternatives to the traditional Baum-Welch algorithm for learning HMM parameters. We then compare the best of these strategies to Baum-Welch on a real hand gesture recognition system in an attempt to develop insights into these fundamental aspects of learning.

[1]  D. Vere-Jones Markov Chains , 1972, Nature.

[2]  V. Balasubramanian Equivalence and Reduction of Hidden Markov Models , 1993 .

[3]  L. Baum,et al.  A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[4]  Nianjun Liu,et al.  Effect of initial HMM choices in multiple sequence training for gesture recognition , 2004, International Conference on Information Technology: Coding and Computing, 2004. Proceedings. ITCC 2004..

[5]  Brendan McCane,et al.  Components analysis of hidden Markov models in computer vision , 2003, 12th International Conference on Image Analysis and Processing, 2003.Proceedings..

[6]  Andreas Stolcke,et al.  Best-first Model Merging for Hidden Markov Model Induction , 1994, ArXiv.

[7]  Alex Pentland,et al.  Real-time American Sign Language recognition from video using hidden Markov models , 1995 .

[8]  Christian N. S. Pedersen,et al.  Metrics and Similarity Measures for Hidden Markov Models , 1999, ISMB.

[9]  L. Goddard Information Theory , 1962, Nature.

[10]  Brian C. Lovell,et al.  Improved estimation of hidden Markov model parameters from multiple observation sequences , 2002, Object recognition supported by user interaction for service robots.

[11]  Brendan McCane,et al.  Diagnostic Tools for Evaluating HMM Components , 2002 .

[12]  L. R. Rabiner,et al.  A comparative study of several dynamic time-warping algorithms for connected-word recognition , 1981, The Bell System Technical Journal.

[13]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[14]  Brian C. Lovell,et al.  Comparing and evaluating HMM ensemble training algorithms using train and test and condition number criteria , 2003, Formal Pattern Analysis & Applications.

[15]  Van Nostrand,et al.  Error Bounds for Convolutional Codes and an Asymptotically Optimum Decoding Algorithm , 1967 .

[16]  R. Okafor Maximum likelihood estimation from incomplete data , 1987 .

[17]  Terry Caelli,et al.  Shape Tracking and Production Using Hidden Markov Models , 2001, Int. J. Pattern Recognit. Artif. Intell..

[18]  G. McLachlan,et al.  The EM algorithm and extensions , 1996 .

[19]  Gary Bradski,et al.  Computer Vision Face Tracking For Use in a Perceptual User Interface , 1998 .

[20]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[21]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[22]  D. N. Prabhakar Murthy,et al.  Wiley Series in Probability and Statistics , 2003 .

[23]  L. R. Rabiner,et al.  An introduction to the application of the theory of probabilistic functions of a Markov process to automatic speech recognition , 1983, The Bell System Technical Journal.

[24]  Matthew Brand,et al.  An Entropic Estimator for Structure Discovery , 1998, NIPS.

[25]  Alex Pentland,et al.  Real-Time American Sign Language Recognition Using Desk and Wearable Computer Based Video , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[26]  Jin-Hyung Kim,et al.  An HMM-Based Threshold Model Approach for Gesture Recognition , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[27]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.