On the Selection of a Classification Technique for the Representation and Recognition of Dynamic Gestures

Previous evaluations of gesture recognition techniques have been focused on classification performance, while ignoring other relevant issues such as knowledge description, feature selection, error distribution and learning performance. In this paper, we present an empirical comparison of decision trees, neural networks and hidden Markov models for visual gesture recognition following these criteria. Our results show that none of these techniques is a definitive alternative for all these issues. While neural nets and hidden Markov models show the highest recognition rates, they sacrifice clarity of its knowledge; decision trees, on the other hand, are easy to create and analyze. Moreover, error dispersion is higher with neural nets. This information could be useful to develop a general computational theory of gestures. For the experiments, a database of 9 gestures with more than 7000 samples taken from 15 people was used.

[1]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[2]  H.H. Aviles-Arriaga,et al.  Visual recognition of gestures using dynamic naive Bayesian classifiers , 2003, The 12th IEEE International Workshop on Robot and Human Interactive Communication, 2003. Proceedings. ROMAN 2003..

[3]  Abdolhossein Sarrafzadeh,et al.  Modeling and Recognition of Gesture Signals in 2D Space: A Comparison of NN and SVM Approaches , 2006, 2006 18th IEEE International Conference on Tools with Artificial Intelligence (ICTAI'06).

[4]  R.J.J.H. van Son The relation between the error distribution and the error rate in identification experiments , 1995 .

[5]  Alex Pentland,et al.  Invariant features for 3-D gesture recognition , 1996, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition.

[6]  R. Watson A Survey of Gesture Recognition Techniques , 1993 .

[7]  Michael G. Strintzis,et al.  A gesture recognition system using 3D data , 2002, Proceedings. First International Symposium on 3D Data Processing Visualization and Transmission.

[8]  Horst-Michael Groß,et al.  Implementation and comparison of three architectures for gesture recognition , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[9]  Kanti V. Mardia,et al.  Techniques for online gesture recognition on workstations , 1993, Image Vis. Comput..

[10]  David G. Stork,et al.  Pattern Classification , 1973 .

[11]  R. Watson A Survey of Gesture RecognitionTechniques. , 1993 .

[12]  Alistair Sutherland,et al.  Dynamic gesture recognition using PCA with multiscale theory and HMM , 2001, International Symposium on Multispectral Image Processing and Pattern Recognition.

[13]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[14]  Ian Witten,et al.  Data Mining , 2000 .

[15]  C. Creider Hand and Mind: What Gestures Reveal about Thought , 1994 .

[16]  Luis Enrique Sucar,et al.  Visual Recognition of Similar Gestures , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[17]  Aiko M. Hormann,et al.  Programs for Machine Learning. Part I , 1962, Inf. Control..

[18]  James W. Davis,et al.  The representation and recognition of human movement using temporal templates , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[19]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[20]  David G. Stork,et al.  Pattern Classification (2nd ed.) , 1999 .

[21]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[22]  Yangsheng Xu,et al.  Hidden Markov Model for Gesture Recognition , 1994 .

[23]  Alex Waibel,et al.  Readings in speech recognition , 1990 .

[24]  Vladimir Pavlovic,et al.  Visual Interpretation of Hand Gestures for Human-Computer Interaction: A Review , 1997, IEEE Trans. Pattern Anal. Mach. Intell..