Classifying Hand Gestures with a View-Based Distributed Representation

We present a method for learning, tracking, and recognizing human hand gestures recorded by a conventional CCD camera without any special gloves or other sensors. A view-based representation is used to model aspects of the hand relevant to the trained gestures, and is found using an unsupervised clustering technique. We use normalized correlation networks, with dynamic time warping in the temporal domain, as a distance function for unsupervised clustering. Views are computed separably for space and time dimensions; the distributed response of the combination of these units characterizes the input data with a low dimensional representation. A supervised classification stage uses labeled outputs of the spatio-temporal units as training data. Our system can correctly classify gestures in real time with a low-cost image processing accelerator.

[1]  R. Gray,et al.  Vector quantization , 1984, IEEE ASSP Magazine.

[2]  Ronen Basri,et al.  Recognition by Linear Combinations of Models , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Yasuhito Suenaga,et al.  Real-Time Detection of Pointing Actions for a Glove-Free Interface , 1992, MVA.

[4]  T. Poggio,et al.  A network that learns to recognize three-dimensional objects , 1990, Nature.

[5]  J. Makhoul,et al.  Vector quantization in speech coding , 1985, Proceedings of the IEEE.

[6]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[7]  Roberto Cipolla,et al.  Qualitative Visual Interpretation of 3d Hand Gestures Using Motion Parallax , 1992, MVA.

[8]  Fumio Kishino,et al.  Real time hand shape recognition using pipe-line image processor , 1992, [1992] Proceedings IEEE International Workshop on Robot and Human Communication.

[9]  Tomaso Poggio,et al.  Example Based Image Analysis and Synthesis , 1993 .

[10]  Thomas M. Breuel,et al.  View-Based Recognition , 1992, MVA.

[11]  Hiroshi Murase,et al.  Learning and recognition of 3D objects from appearance , 1993, [1993] Proceedings IEEE Workshop on Qualitative Vision.