Human action recognition using deep rule-based classifier

In recent years, numerous techniques have been proposed for human activity recognition (HAR) from images and videos. These techniques can be divided into two major categories: handcrafted and deep learning. Deep Learning-based models have produced remarkable results for HAR. However, these models have several shortcomings, such as the requirement for a massive amount of training data, lack of transparency, offline nature, and poor interpretability of their internal parameters. In this paper, a new approach for HAR is proposed, which consists of an interpretable, self-evolving, and self-organizing set of 0-order If...THEN rules. This approach is entirely data-driven, and non-parametric; thus, prototypes are identified automatically during the training process. To demonstrate the effectiveness of the proposed method, a set of high-level features is obtained using a pre-trained deep convolution neural network model, and a recently introduced deep rule-based classifier is applied for classification. Experiments are performed on a challenging benchmark dataset UCF50; results confirmed that the proposed approach outperforms state-of-the-art methods. In addition to this, an ablation study is conducted to demonstrate the efficacy of the proposed approach by comparing the performance of our DRB classifier with four state-of-the-art classifiers. This analysis revealed that the DRB classifier could perform better than state-of-the-art classifiers, even with limited training samples.

[1]  Kang Ryoung Park,et al.  Fuzzy system based human behavior recognition by combining behavior prediction and recognition , 2017, Expert Syst. Appl..

[2]  Jürgen Schmidhuber,et al.  Multi-column deep neural networks for image classification , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Plamen P. Angelov,et al.  Autonomous learning multi-model classifier of 0-Order (ALMMo-0) , 2017, 2017 Evolving and Adaptive Intelligent Systems (EAIS).

[4]  Mohan S. Kankanhalli,et al.  Hierarchical Clustering Multi-Task Learning for Joint Human Action Grouping and Recognition , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Andrew Zisserman,et al.  Two-Stream Convolutional Networks for Action Recognition in Videos , 2014, NIPS.

[7]  Zhe Wang,et al.  Towards Good Practices for Very Deep Two-Stream ConvNets , 2015, ArXiv.

[8]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[9]  Plamen Angelov,et al.  Autonomous Learning Systems: From Data Streams to Knowledge in Real-time , 2013 .

[10]  Bhiksha Raj,et al.  Beyond Gaussian Pyramid: Multi-skip Feature Stacking for action recognition , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Xiaofeng Wang,et al.  Human action recognition using transfer learning with deep representations , 2017, 2017 International Joint Conference on Neural Networks (IJCNN).

[12]  Fei-Fei Li,et al.  Large-Scale Video Classification with Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Jan Kautz,et al.  SENSE: A Shared Encoder Network for Scene-Flow Estimation , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[14]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Jinhui Tang,et al.  Weakly Supervised Deep Matrix Factorization for Social Image Understanding , 2017, IEEE Transactions on Image Processing.

[16]  Tiejun Huang,et al.  P-ODN: Prototype-based Open Deep Network for Open Set Recognition , 2019, Scientific Reports.

[17]  M. Mildner,et al.  Re-epithelialization and immune cell behaviour in an ex vivo human skin model , 2020, Scientific Reports.

[18]  Xiaowei Gu,et al.  A cascade of deep learning fuzzy rule-based image classifier and SVM , 2017, 2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[19]  Tahir Cetin Akinci,et al.  Evaluation of student performance in laboratory applications using fuzzy logic , 2010 .

[20]  C. Krishna Mohan,et al.  Coherent and Noncoherent Dictionaries for Action Recognition , 2017, IEEE Signal Processing Letters.

[21]  Andrea Vedaldi,et al.  Transactions on Pattern Analysis and Machine Intelligence 1 Action Recognition with Dynamic Image Networks , 2022 .

[22]  Steven C. H. Hoi,et al.  Face Detection using Deep Learning: An Improved Faster RCNN Approach , 2017, Neurocomputing.

[23]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[24]  Chi Zhang,et al.  What Elements are Essential to Recognize Human Actions? , 2019, CVPR Workshops.

[25]  Sung Wook Baik,et al.  Action recognition using optimized deep autoencoder and CNN for surveillance data streams of non-stationary environments , 2019, Future Gener. Comput. Syst..

[26]  Plamen Angelov,et al.  Human Action Recognition from Multiple Views Based on View-Invariant Feature Descriptor Using Support Vector Machines , 2016 .

[27]  Lorenzo Torresani,et al.  Learning Spatiotemporal Features with 3D Convolutional Networks , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[28]  Amir Roshan Zamir,et al.  Action Recognition in Realistic Sports Videos , 2014 .

[29]  Yi Yang,et al.  Semi-Supervised Multiple Feature Analysis for Action Recognition , 2014, IEEE Transactions on Multimedia.

[30]  Limin Wang,et al.  Bag of visual words and fusion methods for action recognition: Comprehensive study and good practice , 2014, Comput. Vis. Image Underst..

[31]  Cordelia Schmid,et al.  Long-Term Temporal Convolutions for Action Recognition , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  Ming Yang,et al.  3D Convolutional Neural Networks for Human Action Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33]  Chong Wang,et al.  Deep Speech 2 : End-to-End Speech Recognition in English and Mandarin , 2015, ICML.

[34]  P. Angelov,et al.  Empirical Approach to Machine Learning , 2018, Studies in Computational Intelligence.

[35]  Y-Lan Boureau,et al.  Learning Convolutional Feature Hierarchies for Visual Recognition , 2010, NIPS.

[36]  Muhammad Haroon Yousaf,et al.  A Bag of Expression framework for improved human action recognition , 2018, Pattern Recognit. Lett..

[37]  Cordelia Schmid,et al.  Human Detection Using Oriented Histograms of Flow and Appearance , 2006, ECCV.

[38]  Serge J. Belongie,et al.  Behavior recognition via sparse spatio-temporal features , 2005, 2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance.

[39]  Ivor W. Tsang,et al.  DEFEATnet—A Deep Conventional Image Representation for Image Classification , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[40]  Plamen P. Angelov,et al.  Deep rule-based classifier with human-level performance and characteristics , 2018, Inf. Sci..

[41]  Andrew Zisserman,et al.  Convolutional Two-Stream Network Fusion for Video Action Recognition , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[42]  WangLimin,et al.  Bag of visual words and fusion methods for action recognition , 2016 .

[43]  Ming Yang,et al.  DeepFace: Closing the Gap to Human-Level Performance in Face Verification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[44]  Jyh-Yeong Chang,et al.  Fuzzy rule inference based human activity recognition , 2009, 2009 IEEE Control Applications, (CCA) & Intelligent Control, (ISIC).

[45]  Hani Hagras,et al.  A fuzzy logic-based system for the automation of human behavior recognition using machine vision in intelligent environments , 2015, Soft Comput..

[46]  Cordelia Schmid,et al.  Dense Trajectories and Motion Boundary Descriptors for Action Recognition , 2013, International Journal of Computer Vision.

[47]  Jim Tørresen,et al.  A Robust Human Activity Recognition Approach Using OpenPose, Motion Features, and Deep Recurrent Neural Network , 2019, SCIA.

[48]  Bolei Zhou,et al.  Learning Deep Features for Scene Recognition using Places Database , 2014, NIPS.

[49]  Thomas Serre,et al.  HMDB: A large video database for human motion recognition , 2011, 2011 International Conference on Computer Vision.

[50]  Hanli Wang,et al.  Motion keypoint trajectory and covariance descriptor for human action recognition , 2018, The Visual Computer.

[51]  Feiping Nie,et al.  Saliency Detection via a Multiple Self-Weighted Graph-Based Manifold Ranking , 2020, IEEE Transactions on Multimedia.

[52]  Plamen P. Angelov,et al.  Semi-supervised deep rule-based approach for image classification , 2018, Appl. Soft Comput..

[53]  Plamen P. Angelov,et al.  A Massively Parallel Deep Rule-Based Ensemble Classifier for Remote Sensing Scenes , 2018, IEEE Geoscience and Remote Sensing Letters.

[54]  Andrew Zisserman,et al.  Deep Face Recognition , 2015, BMVC.

[55]  AizawaKiyoharu,et al.  Efficient human action recognition using histograms of motion gradients and VLAD with descriptor shape information , 2017 .

[56]  Yann LeCun,et al.  Convolutional Learning of Spatio-temporal Features , 2010, ECCV.

[57]  Junwei Han,et al.  Synthesizing Supervision for Learning Deep Saliency Network without Human Annotation , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[58]  Rama Chellappa,et al.  Cross-View Action Recognition via Transferable Dictionary Learning , 2016, IEEE Transactions on Image Processing.

[59]  S. Solberg,et al.  Mapping patterns of abiotic and biotic stress resilience uncovers conservation gaps and breeding potential of Vigna wild relatives , 2020, Scientific Reports.

[60]  Ronen Basri,et al.  Actions as space-time shapes , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[61]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[62]  Mubarak Shah,et al.  Recognizing 50 human action categories of web videos , 2012, Machine Vision and Applications.

[63]  Andrea Vedaldi,et al.  Dynamic Image Networks for Action Recognition , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[64]  Plamen P. Angelov,et al.  A new type of simplified fuzzy rule-based system , 2012, Int. J. Gen. Syst..

[65]  Cordelia Schmid,et al.  Action Recognition with Improved Trajectories , 2013, 2013 IEEE International Conference on Computer Vision.

[66]  Mubarak Shah,et al.  Learning a Deep Model for Human Action Recognition from Novel Viewpoints , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[67]  Plamen Angelov,et al.  A Comprehensive Review on Handcrafted and Learning-Based Action Representation Approaches for Human Activity Recognition , 2017 .

[68]  Theo Gevers,et al.  Evaluation of Color STIPs for Human Action Recognition , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[69]  Barbara Caputo,et al.  Recognizing human actions: a local SVM approach , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[70]  Cordelia Schmid,et al.  A Robust and Efficient Video Representation for Action Recognition , 2015, International Journal of Computer Vision.

[71]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[72]  Zhi-Qiang Liu,et al.  Type-2 Fuzzy Topic Models for Human Action Recognition , 2015, IEEE Transactions on Fuzzy Systems.

[73]  Dong Xu,et al.  Advanced Deep-Learning Techniques for Salient and Category-Specific Object Detection: A Survey , 2018, IEEE Signal Processing Magazine.

[74]  Nicu Sebe,et al.  Efficient human action recognition using histograms of motion gradients and VLAD with descriptor shape information , 2017, Multimedia Tools and Applications.

[75]  Bernadette Dorizzi,et al.  Human activities of daily living recognition using fuzzy logic for elderly home monitoring , 2009, 2009 IEEE International Conference on Fuzzy Systems.