A Hierarchical Action Recognition System Applying Fisher Discrimination Dictionary Learning via Sparse Representation

In this paper, we propose a hierarchical action recognition system applying Fisher discrimination dictionary learning via sparse representation classifier. Feature vectors used to represent certain actions are first generated by employing local features extracted from motion field maps. Sparse representation classification (SRC) are then employed on those feature vectors, in which a structured dictionary for classification is learned applying Fisher discrimination dictionary learning (FDDL). We tested our algorithms on Weizmann human database and KTH human database, and compared the recognition rates with other modeling methods such as k-nearest neighbor. Results showed that the action recognition system applying FDDL can achieve better performance despite that the learning stage for the Fisher discrimination dictionary can converge within only several iterations.

[1]  Andrew Zisserman,et al.  Progressive search space reduction for human pose estimation , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Serge J. Belongie,et al.  Behavior recognition via sparse spatio-temporal features , 2005, 2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance.

[3]  Pietro Perona,et al.  Human action recognition by sequence of movelet codewords , 2002, Proceedings. First International Symposium on 3D Data Processing Visualization and Transmission.

[4]  Cristian Sminchisescu,et al.  Conditional Random Fields for Contextual Human Motion Recognition , 2005, ICCV.

[5]  Dimitris N. Metaxas,et al.  Handshapes and Movements: Multiple-Channel American Sign Language Recognition , 2003, Gesture Workshop.

[6]  Allen Y. Yang,et al.  Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  A. Hampapur,et al.  Smart video surveillance: exploring the concept of multiscale spatiotemporal tracking , 2005, IEEE Signal Processing Magazine.

[8]  Hongying Meng,et al.  A Human Action Recognition System for Embedded Computer Vision Application , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Mubarak Shah,et al.  Learning human actions via information maximization , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Ronald Poppe,et al.  A survey on vision-based human action recognition , 2010, Image Vis. Comput..

[11]  Michitaka Hirose,et al.  A pilot study on virtual camera control via Steady-State VEP in immersing virtual environments , 2008 .

[12]  Tadashi Shibata,et al.  A gesture perception algorithm using compact one-dimensional representation of spatio-temporal motion-field patches , 2009, 2009 3rd International Conference on Signal Processing and Communication Systems.

[13]  Juan Carlos Niebles,et al.  Unsupervised Learning of Human Action Categories Using Spatial-Temporal Words , 2008, International Journal of Computer Vision.

[14]  Michael Elad,et al.  Sparse Representation for Color Image Restoration , 2008, IEEE Transactions on Image Processing.

[15]  David Zhang,et al.  Fisher Discrimination Dictionary Learning for sparse representation , 2011, 2011 International Conference on Computer Vision.

[16]  Deva Ramanan,et al.  Learning to parse images of articulated bodies , 2006, NIPS.

[17]  Jitendra Malik,et al.  Recognizing action at a distance , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[18]  Sebastian Nowozin,et al.  Discriminative Subsequence Mining for Action Classification , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[19]  C. Koch,et al.  Invariant visual representation by single neurons in the human brain , 2005, Nature.

[20]  Thomas Serre,et al.  A Biologically Inspired System for Action Recognition , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[21]  Barbara Caputo,et al.  Recognizing human actions: a local SVM approach , 2004, ICPR 2004.

[22]  Antonio Camurri,et al.  Gesture-Based Communication in Human-Computer Interaction , 2003, Lecture Notes in Computer Science.

[23]  Tadashi Shibata,et al.  Spatio-temporal motion field descriptors for the hierarchical action recognition system , 2011, 2011 5th International Conference on Signal Processing and Communication Systems (ICSPCS).

[24]  Ronen Basri,et al.  Actions as space-time shapes , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[25]  Javier de Diego,et al.  Proceedings oh the International Congress of Mathematicians: Madrid, August 22-30,2006 : invited lectures , 2006 .