Human Action Recognition with Hierarchical Growing Neural Gas Learning

We propose a novel biologically inspired framework for the recognition of human full-body actions. First, we extract body pose and motion features from depth map sequences. We then cluster pose-motion cues with a two-stream hierarchical architecture based on growing neural gas (GNG). Multi-cue trajectories are finally combined to provide prototypical action dynamics in the joint feature space. We extend the unsupervised GNG with two labelling functions for classifying clustered trajectories. Noisy samples are automatically detected and removed from the training and the testing set. Experiments on a set of 10 human actions show that the use of multi-cue learning leads to substantially increased recognition accuracy over the single-cue approach and the learning of joint pose-motion vectors.

[1]  Ran Xu,et al.  Combining Skeletal Pose with Local Motion for Human Activity Recognition , 2012, AMDO.

[2]  Petros Daras,et al.  Real-Time Skeleton-Tracking-Based Human Action Recognition Using Kinect Data , 2014, MMM.

[3]  Robin R. Murphy,et al.  Hand gesture recognition with depth images: A review , 2012, 2012 IEEE RO-MAN: The 21st IEEE International Symposium on Robot and Human Interactive Communication.

[4]  Bernd Fritzke,et al.  A Growing Neural Gas Network Learns Topologies , 1994, NIPS.

[5]  T. Poggio,et al.  Cognitive neuroscience: Neural mechanisms for the recognition of biological movements , 2003, Nature Reviews Neuroscience.

[6]  Pierre Kornprobst,et al.  Action Recognition with a Bio-inspired Feedforward Motion Processing Model: The Richness of Center-Surround Interactions , 2008, ECCV.

[7]  Larry S. Davis,et al.  Recognizing Human Actions by Learning and Matching Shape-Motion Prototype Trees , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Chong-Wah Ngo,et al.  Multimedia modeling , 2013, Multimedia Tools and Applications.

[9]  Stefan Wermter,et al.  FINGeR: Framework for interactive neural-based gesture recognition , 2014, ESANN.

[10]  Philipp Cimiano,et al.  Online Labelling Strategies for Growing Neural Gas , 2011, IDEAL.

[11]  Andrew J. Davison,et al.  Active Matching , 2008, ECCV.

[12]  Victor J. Rayward-Smith,et al.  Intelligent Data Engineering and Automated Learning - IDEAL 2011 , 2011, Lecture Notes in Computer Science.

[13]  Stefan Wermter,et al.  Hierarchical SOM-based detection of novel behavior for 3D human tracking , 2013, The 2013 International Joint Conference on Neural Networks (IJCNN).

[14]  H. Offret Computational Maps in the Visual Cortex , 2006 .

[15]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[16]  Abdenour Bouzouane,et al.  POSSIBILISTIC ACTIVITY RECOGNITION IN SMART HOMES FOR COGNITIVELY IMPAIRED PEOPLE , 2011, Appl. Artif. Intell..