Moments and Wavelets for Classification of Human Gestures Represented by Spatio-Temporal Templates

This paper reports a novel technique to classify short duration articulated object motion in video data The motion is represented by a spatio-temporal template (STT), a view based approach, which collapses temporal component into a static grey scale image in a way that no explicit sequence matching or temporal analysis is needed, and characterizes the motion from a very high dimensional space to a low dimensional space These templates are modified to be invariant to translation and scale Two dimensional, 3 level dyadic wavelet transform applied on these templates results in one low pass subimage and nine highpass directional subimages Histograms of STTs and of the wavelet coefficients at different scales are compared to establish significance of available information for classification To further reduce the feature space, histograms of STTs are represented by orthogonal Legendre moments, and the wavelet subbands are modelled by generalized Gaussian density (GGD) parameters – shape factor and standard deviation The preliminary experiments show that directional information in wavelet subbands improves the histogram-based technique, and that use of moments combined with GGD parameters improves the performance efficiency in addition to significantly reducing complexity of comparing directly the histograms.

[1]  K A Birney,et al.  On the modeling of DCT and subband image data for compression , 1995, IEEE Trans. Image Process..

[2]  G. Johansson Visual perception of biological motion and a model for its analysis , 1973 .

[3]  Mubarak Shah,et al.  Motion-based recognition a survey , 1995, Image Vis. Comput..

[4]  Alex Pentland,et al.  Facial expression recognition using a dynamic model and motion energy , 1995, Proceedings of IEEE International Conference on Computer Vision.

[5]  Sethuraman Panchanathan,et al.  Illumination invariant image indexing using moments and wavelets , 1998, J. Electronic Imaging.

[6]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Majid Ahmadi,et al.  Pattern recognition with moment invariants: A comparative study and new results , 1991, Pattern Recognit..

[8]  Arun Sharma,et al.  Wavelet directional histograms for classification of human gestures represented by spatio-temporal templates , 2004, 10th International Multimedia Modelling Conference, 2004. Proceedings..

[9]  S. Panchanathan,et al.  Image Indexing Using Moments and Wavelets , 1996, 1996. Digest of Technical Papers., International Conference on Consumer Electronics.

[10]  Erwin Kreyszig,et al.  Advanced Engineering Mathematics, Maple Computer Guide , 2000 .

[11]  David Zeltzer,et al.  A survey of glove-based input , 1994, IEEE Computer Graphics and Applications.

[12]  Ming-Kuei Hu,et al.  Visual pattern recognition by moment invariants , 1962, IRE Trans. Inf. Theory.

[13]  Arun Sharma,et al.  Wavelet Directional Histograms of the Spatio-Temporal Templates of Human Gestures , 2004, Int. J. Wavelets Multiresolution Inf. Process..

[14]  Geoffrey E. Hinton,et al.  Glove-Talk: a neural network interface between a data-glove and a speech synthesizer , 1993, IEEE Trans. Neural Networks.

[15]  John W. Woods,et al.  Comment on "Estimation of shape parameter for generalized Gaussian distribution in subband decompositions of video"[with reply] , 1995, IEEE Trans. Circuits Syst. Video Technol..

[16]  James W. Davis,et al.  The Recognition of Human Movement Using Temporal Templates , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  R. Mukundan,et al.  Moment Functions in Image Analysis: Theory and Applications , 1998 .

[18]  Michael J. Black,et al.  Tracking and recognizing rigid and non-rigid facial motions using local parametric models of image motion , 1995, Proceedings of IEEE International Conference on Computer Vision.

[19]  Hassan Qjidaa,et al.  Robust Line Fitting in a Noisy Image by the Method of Moments , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Michael J. Swain,et al.  Color indexing , 1991, International Journal of Computer Vision.

[21]  M. Bodruzzaman,et al.  Feature extraction using wavelet transform for neural network based image classification , 1998, Proceedings of Thirtieth Southeastern Symposium on System Theory.

[22]  R. Nelson,et al.  Low level recognition of human motion (or how to get your man without finding his body parts) , 1994, Proceedings of 1994 IEEE Workshop on Motion of Non-rigid and Articulated Objects.

[23]  Markus A. Stricker,et al.  Similarity of color images , 1995, Electronic Imaging.

[24]  Mubarak Shah,et al.  Motion-Based Recognition , 1997, Computational Imaging and Vision.

[25]  Stéphane Mallat,et al.  Wavelets for a vision , 1996, Proc. IEEE.

[26]  Isaac Weiss,et al.  Geometric invariants and object recognition , 1993, International Journal of Computer 11263on.

[27]  Alberto Leon-Garcia,et al.  Estimation of shape parameter for generalized Gaussian distributions in subband decompositions of video , 1995, IEEE Trans. Circuits Syst. Video Technol..

[28]  G. Johansson Visual motion perception. , 1975, Scientific American.

[29]  F. W. Kellaway,et al.  Advanced Engineering Mathematics , 1969, The Mathematical Gazette.

[30]  Michel Beaudouin-Lafon,et al.  Charade: remote control of objects using free-hand gestures , 1993, CACM.