Constructive, robust and adaptive OS-ELM in human action recognition

We introduce a constructive, robust, and adaptive OS-ELM (Online Sequential Extreme Machine Learning) that combines learning strategies of Constructive Enhancement OSELM and Robust OS-ELM, and add adaptive capability to receive a new target class requirement during sequential learning and applied in human action recognition. The overall strategy is aimed to deal against parameters tuning and new requirements drift problem in the sequential learning process that commonly happened in human action recognition. Our proposed method has an automatic and systematic approach for determining input weight and bias value when the hidden nodes need to be increased to handle a larger training data size and to adjust the output weight when the new target class label is presented during sequential learning. We demonstrated the capability using 2013 Challenge on Multi modal Gesture Recognition open dataset (Chalearn 2013) with uni modal features (skeleton data) only. Our experiments using skeletons of upper body joints features with normalized euclidean distance and projection angle position coordinates to shoulder center and hip center, clustering analysis with k-means for pose based generation and Bag of Pose (BoP) for temporal sequential information. We developed the training sequence scenario to introduce the partial action classes in the initial training and the rest of training data with all classes in the next sequences to simulate the condition when the OS-ELM received new target class requirement drift. Our proposed method gives better accuracy plus adaptive capability compared with SUMO method which is the best uni modal method for chalearn data and still maintain a reasonably small computation time.

[1]  Zhiquan Wang,et al.  Recognition of human activities using SVM multi-class classifier , 2010, Pattern Recognit. Lett..

[2]  Yuan Lan,et al.  A constructive enhancement for Online Sequential Extreme Learning Machine , 2009, 2009 International Joint Conference on Neural Networks.

[3]  Dianhui Wang,et al.  Extreme learning machines: a survey , 2011, Int. J. Mach. Learn. Cybern..

[4]  Giulio Paci,et al.  A Multi-scale Approach to Gesture Detection and Recognition , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[5]  Adriana Kovashka,et al.  Learning a hierarchy of discriminative space-time neighborhood features for human action recognition , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[6]  Chee Kheong Siew,et al.  Incremental extreme learning machine with fully complex hidden nodes , 2008, Neurocomputing.

[7]  Meng Joo Er,et al.  An Enhanced Online Sequential Extreme Learning Machine algorithm , 2008, 2008 Chinese Control and Decision Conference.

[8]  Narasimhan Sundararajan,et al.  A generalized growing and pruning RBF (GGAP-RBF) neural network for function approximation , 2005, IEEE Transactions on Neural Networks.

[9]  Ronald Poppe,et al.  A survey on vision-based human action recognition , 2010, Image Vis. Comput..

[10]  Yonggwan Won,et al.  A Robust Online Sequential Extreme Learning Machine , 2007, ISNN.

[11]  Qiang Ji,et al.  Knowledge Based Activity Recognition with Dynamic Bayesian Network , 2010, ECCV.

[12]  Mohamad Ivan Fanany,et al.  Pose-based 3D human motion analysis using Extreme Learning Machine , 2013, 2013 IEEE 2nd Global Conference on Consumer Electronics (GCCE).

[13]  Narasimhan Sundararajan,et al.  A Fast and Accurate Online Sequential Learning Algorithm for Feedforward Networks , 2006, IEEE Transactions on Neural Networks.

[14]  Hsuan-Sheng Chen,et al.  Human action recognition using star skeleton , 2006, VSSN '06.

[15]  Chee Kheong Siew,et al.  Extreme learning machine: Theory and applications , 2006, Neurocomputing.

[16]  Markus Koskela,et al.  Online RGB-D gesture recognition with extreme learning machines , 2013, ICMI '13.

[17]  Y Lu,et al.  A Sequential Learning Scheme for Function Approximation Using Minimal Radial Basis Function Neural Networks , 1997, Neural Computation.

[18]  Sergio Escalera,et al.  Multi-modal gesture recognition challenge 2013: dataset and results , 2013, ICMI '13.

[19]  Ling Shao,et al.  Human action recognition based on boosted feature selection and naive Bayes nearest-neighbor classification , 2013, Signal Process..

[20]  Christophe De Vleeschouwer,et al.  Visual event recognition using decision trees , 2010, Multimedia Tools and Applications.

[21]  J.K. Aggarwal,et al.  Human activity analysis , 2011, ACM Comput. Surv..

[22]  Narasimhan Sundararajan,et al.  A sequential multi-category classifier using radial basis function networks , 2008, Neurocomputing.

[23]  Paramasivan Saratchandran,et al.  Performance evaluation of a sequential minimal radial basis function (RBF) neural network learning algorithm , 1998, IEEE Trans. Neural Networks.

[24]  Ming Yang,et al.  3D Convolutional Neural Networks for Human Action Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.