Kernel-based sparse representation for gesture recognition

In this paper, we propose a novel sparse representation based framework for classifying complicated human gestures captured as multi-variate time series (MTS). The novel feature extraction strategy, CovSVDK, can overcome the problem of inconsistent lengths among MTS data and is robust to the large variability within human gestures. Compared with PCA and LDA, the CovSVDK features are more effective in preserving discriminative information and are more efficient to compute over large-scale MTS datasets. In addition, we propose a new approach to kernelize sparse representation. Through kernelization, realized dictionary atoms are more separable for sparse coding algorithms and nonlinear relationships among data are conveniently transformed into linear relationships in the kernel space, which leads to more effective classification. Finally, the superiority of the proposed framework is demonstrated through extensive experiments.

[1]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[2]  Cyrus Shahabi,et al.  A PCA-based similarity measure for multivariate time series , 2004, MMDB '04.

[3]  George C. Runger,et al.  A Bag-of-Features Framework to Classify Time Series , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[5]  W. Krzanowski Between-Groups Comparison of Principal Components , 1979 .

[6]  Eamonn J. Keogh,et al.  A Complexity-Invariant Distance Measure for Time Series , 2011, SDM.

[7]  Joel A. Tropp,et al.  Signal Recovery From Random Measurements Via Orthogonal Matching Pursuit , 2007, IEEE Transactions on Information Theory.

[8]  Manolya Kavakli,et al.  Hand Gesture Recognition Based on Segmented Singular Value Decomposition , 2010, KES.

[9]  M. Elad,et al.  $rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation , 2006, IEEE Transactions on Signal Processing.

[10]  B. Ripley,et al.  Pattern Recognition , 1968, Nature.

[11]  Shandong Wu,et al.  On Signature Invariants for Effective Motion Trajectory Recognition , 2008, Int. J. Robotics Res..

[12]  Dan Schonfeld,et al.  Object Trajectory-Based Activity Classification and Recognition Using Hidden Markov Models , 2007, IEEE Transactions on Image Processing.

[13]  Emmanuel J. Candès,et al.  Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information , 2004, IEEE Transactions on Information Theory.

[14]  Junyi Shen,et al.  Classification of multivariate time series using two-dimensional singular value decomposition , 2008, Knowl. Based Syst..

[15]  Yi Li,et al.  Learning shift-invariant sparse representation of actions , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[16]  J. CandesE.,et al.  Robust uncertainty principles , 2006 .

[17]  Vladimir Pavlovic,et al.  Discriminative Learning of Mixture of Bayesian Network Classifiers for Sequence Classification , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[18]  E. Candes,et al.  11-magic : Recovery of sparse signals via convex programming , 2005 .

[19]  Kenneth E. Barner,et al.  Hybrid feature selection for gesture recognition using support vector machines , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[20]  Aaron F. Bobick,et al.  Performance Analysis of Time-Distance Gait Parameters under Different Speeds , 2003, AVBPA.

[21]  Dimitrios Gunopulos,et al.  Indexing multi-dimensional time-series with support for multiple distance measures , 2003, KDD '03.

[22]  Kenneth E. Barner,et al.  An enhanced sparse representation strategy for signal classification , 2012, Defense + Commercial Sensing.

[23]  Baoxin Li,et al.  Discriminative K-SVD for dictionary learning in face recognition , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[24]  B. Prabhakaran,et al.  Segmentation and recognition of multi-attribute motion sequences , 2004, MULTIMEDIA '04.

[25]  Eamonn J. Keogh,et al.  Making Time-Series Classification More Accurate Using Learned Constraints , 2004, SDM.

[26]  Rawesak Tanawongsuwan,et al.  Characteristics of Time-Distance Gait Parameters Across Speeds , 2003 .

[27]  Liang-Tien Chia,et al.  Kernel Sparse Representation for Image Classification and Face Recognition , 2010, ECCV.

[28]  Ting Wang,et al.  Kernel Sparse Representation-Based Classifier , 2012, IEEE Transactions on Signal Processing.

[29]  Radu-Daniel Vatavu,et al.  Gestures as point clouds: a $P recognizer for user interface prototypes , 2012, ICMI '12.

[30]  Deanna Needell,et al.  CoSaMP: Iterative signal recovery from incomplete and inaccurate samples , 2008, ArXiv.

[31]  B. Prabhakaran,et al.  Segmentation and recognition of motion streams by similarity search , 2007, TOMCCAP.

[32]  Yukie Ikedaa,et al.  Knowledge Based and Intelligent Information and Engineering Systems An Evacuation Route Planning for Safety Route Guidance System after Natural Disaster Using Multi-Objective Genetic Algorithm , 2016 .

[33]  Allen Y. Yang,et al.  Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  A. Bruckstein,et al.  K-SVD : An Algorithm for Designing of Overcomplete Dictionaries for Sparse Representation , 2005 .

[35]  C. Shahabi,et al.  A PCA-based Kernel for Kernel PCA on Multivariate Time Series , 2005 .