Knowledge-based disambiguation of hand gestures

An algorithm is presented in this paper to disambiguate sequences of hand gestures by utilizing their overall meaning. This algorithm relieves the traditional reliance on a potentially complex syntactical analysis step by sharing this burden with higher level processing. A minimal set of concurrent primitives are used to represent hand gestures, improving scalability while reducing the complexity of training and recognition. These gesture primitives are synthesized into concepts and associated with a knowledge base using an approximate graph matching technique to determine their overall meaning. Initial experimental results have shown that this technique is able to successfully disambiguate one or more hand gestures in sequences of 2-5 noisy gestures, resulting in an improved understanding of the overall meaning.

[1]  Ming Ouhyoung,et al.  A real-time continuous gesture recognition system for sign language , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[2]  Hocine Ouhaddi,et al.  3D Hand Gesture Tracking by Model Registration , 1999 .

[3]  Alex Pentland,et al.  Real-time American Sign Language recognition from video using hidden Markov models , 1995 .

[4]  Caroline Hummels,et al.  Meaningful gestures for human computer interaction: beyond hand postures , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[5]  Wolfram Burgard,et al.  Probabilistic Algorithms and the Interactive Museum Tour-Guide Robot Minerva , 2000, Int. J. Robotics Res..

[6]  室 章治郎 Michael R.Garey/David S.Johnson 著, "COMPUTERS AND INTRACTABILITY A guide to the Theory of NP-Completeness", FREEMAN, A5判変形判, 338+xii, \5,217, 1979 , 1980 .

[7]  Stan Sclaroff,et al.  An appearance-based framework for 3D hand shape classification and camera viewpoint estimation , 2002, Proceedings of Fifth IEEE International Conference on Automatic Face Gesture Recognition.

[8]  Dimitris N. Metaxas,et al.  Toward Scalability in ASL Recognition: Breaking Down Signs into Phonemes , 1999, Gesture Workshop.

[9]  Marie-Laure Mugnier,et al.  Knowledge Representation and Reasonings Based on Graph Homomorphism , 2000, ICCS.

[10]  Ming Ouhyoung,et al.  A sign language recognition system using hidden markov model and context sensitive search , 1996, VRST.

[11]  Stan Sclaroff,et al.  3D hand pose reconstruction using specialized mappings , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[12]  Dimitris N. Metaxas,et al.  Adapting hidden Markov models for ASL recognition by using three-dimensional computer vision methods , 1997, 1997 IEEE International Conference on Systems, Man, and Cybernetics. Computational Cybernetics and Simulation.

[13]  Sharon C. Salveter Review of Conceptual structures: information processing in mind and machine by John F. Sowa. Addison-Wesley 1984. , 1986 .

[14]  Michael Vande Weghe,et al.  An architecture for gesture-based control of mobile robots , 1999, Proceedings 1999 IEEE/RSJ International Conference on Intelligent Robots and Systems. Human and Environment Friendly Robots with High Intelligence and Emotional Quotients (Cat. No.99CH36289).

[15]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[16]  John F. Sowa,et al.  Conceptual Structures: Information Processing in Mind and Machine , 1983 .

[17]  Horst Bunke,et al.  Efficient Subgraph Isomorphism Detection: A Decomposition Approach , 2000, IEEE Trans. Knowl. Data Eng..

[18]  John F. Sowa,et al.  Conceptual Graphs: Draft Proposed American National Standard , 1999, ICCS.

[19]  Monica N. Nicolescu,et al.  Learning and interacting in human-robot domains , 2001, IEEE Trans. Syst. Man Cybern. Part A.