Graph fourier transform based descriptor for gesture classification

This paper proposes a method for gesture classification based on Graph Fourier transform (GFT) coefficients. GFT coefficients are the projection of image pixel block onto the eigenvectors of a Laplacian matrix. This Laplacian matrix is generated from undirected graph, representing a spatial connectedness between each pixel within an image block. This work proposes a method for generating an undirected graph by using edge information of the image. Edge information of the image is obtained by average sum of absolute difference between the current pixel and its neighboring pixels by using an appropriate threshold. The resulting GFT based feature vector is formed by concatenating GFT coefficients of each block. The resultant feature vector is applied to linear Support Vector Machine (SVM) classifier to predict the gesture class. For NTU and Massey hand gesture datasets, threshold value 30 gives maximum prediction accuracy. We compare the results of the proposed GFT based descriptor approach with Karhunen-Loeve transform (K-LT) and Discrete Cosine transform (DCT) based descriptors on three different gesture datasets: NTU, Cambridge and Massey. Simulation results show that the proposed GFT based descriptor gives a comparable results with Karhunen-Loeve transform (K-LT) and Discrete Cosine transform (DCT) based descriptors for gesture classification.

[1]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[2]  Pascal Frossard,et al.  The emerging field of signal processing on graphs: Extending high-dimensional data analysis to networks and other irregular domains , 2012, IEEE Signal Processing Magazine.

[3]  Jitendra Malik,et al.  Shape matching and object recognition using shape contexts , 2010, 2010 3rd International Conference on Computer Science and Information Technology.

[4]  Longin Jan Latecki,et al.  Path Similarity Skeleton Graph Matching , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Wenqin Zhuang,et al.  Graph-based Sparse Representation for Image Denoising , 2015 .

[6]  Yael Edan,et al.  Vision-based hand-gesture applications , 2011, Commun. ACM.

[7]  Jan Flusser,et al.  Image Registration: A Survey and Recent Advances , 2005 .

[8]  Junsong Yuan,et al.  Robust Part-Based Hand Gesture Recognition Using Kinect Sensor , 2013, IEEE Transactions on Multimedia.

[9]  Mircea Nicolescu,et al.  Vision-based hand pose estimation: A review , 2007, Comput. Vis. Image Underst..

[10]  R. S. Jadon,et al.  A REVIEW OF VISION BASED HAND GESTURES RECOGNITION , 2009 .

[11]  Lawrence Sirovich,et al.  Application of the Karhunen-Loeve Procedure for the Characterization of Human Faces , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Xiaofei He,et al.  Locality Preserving Projections , 2003, NIPS.

[13]  Chengjun Liu,et al.  Gabor-based kernel PCA with fractional power polynomial models for face recognition , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Lu Yang,et al.  Survey on 3D Hand Gesture Recognition , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[15]  Hong Zhu,et al.  A survey on feature extraction for pattern recognition , 2011, Artificial Intelligence Review.

[16]  Oscar C. Au,et al.  Multiresolution Graph Fourier Transform for Compression of Piecewise Smooth Images , 2015, IEEE Transactions on Image Processing.

[17]  S. Mitra,et al.  Gesture Recognition: A Survey , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[18]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[19]  Antonio Ortega,et al.  Gesture dynamics modeling for attitude analysis using graph based transform , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[20]  Tae-Kyun Kim,et al.  Canonical Correlation Analysis of Video Volume Tensors for Action Categorization and Detection , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Jaya Shukla,et al.  A Method for Hand Gesture Recognition , 2014, 2014 Fourth International Conference on Communication Systems and Network Technologies.

[22]  Napoleon H. Reyes,et al.  A New 2D Static Hand Gesture Colour Image Dataset for ASL Gestures , 2011 .

[23]  Fan Zhang,et al.  Graph spectral image smoothing using the heat kernel , 2008, Pattern Recognit..

[24]  Takeo Kanade,et al.  Use of Fourier and Karhunen-Loeve decomposition for fast pattern matching with a large set of templates , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[25]  Haibin Ling,et al.  Shape Classification Using the Inner-Distance , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[27]  José M. F. Moura,et al.  Discrete Signal Processing on Graphs , 2012, IEEE Transactions on Signal Processing.

[28]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[29]  N. Ahmed,et al.  Discrete Cosine Transform , 1996 .