论文信息 - Human action recognition using deep rule-based classifier

Human action recognition using deep rule-based classifier

In recent years, numerous techniques have been proposed for human activity recognition (HAR) from images and videos. These techniques can be divided into two major categories: handcrafted and deep learning. Deep Learning-based models have produced remarkable results for HAR. However, these models have several shortcomings, such as the requirement for a massive amount of training data, lack of transparency, offline nature, and poor interpretability of their internal parameters. In this paper, a new approach for HAR is proposed, which consists of an interpretable, self-evolving, and self-organizing set of 0-order If...THEN rules. This approach is entirely data-driven, and non-parametric; thus, prototypes are identified automatically during the training process. To demonstrate the effectiveness of the proposed method, a set of high-level features is obtained using a pre-trained deep convolution neural network model, and a recently introduced deep rule-based classifier is applied for classification. Experiments are performed on a challenging benchmark dataset UCF50; results confirmed that the proposed approach outperforms state-of-the-art methods. In addition to this, an ablation study is conducted to demonstrate the efficacy of the proposed approach by comparing the performance of our DRB classifier with four state-of-the-art classifiers. This analysis revealed that the DRB classifier could perform better than state-of-the-art classifiers, even with limited training samples.

[1] Kang Ryoung Park,et al. Fuzzy system based human behavior recognition by combining behavior prediction and recognition , 2017, Expert Syst. Appl..

[2] Jürgen Schmidhuber,et al. Multi-column deep neural networks for image classification , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[3] Plamen P. Angelov,et al. Autonomous learning multi-model classifier of 0-Order (ALMMo-0) , 2017, 2017 Evolving and Adaptive Intelligent Systems (EAIS).

[4] Mohan S. Kankanhalli,et al. Hierarchical Clustering Multi-Task Learning for Joint Human Action Grouping and Recognition , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5] Dumitru Erhan,et al. Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6] Andrew Zisserman,et al. Two-Stream Convolutional Networks for Action Recognition in Videos , 2014, NIPS.

[7] Zhe Wang,et al. Towards Good Practices for Very Deep Two-Stream ConvNets , 2015, ArXiv.

[8] Bill Triggs,et al. Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[9] Plamen Angelov,et al. Autonomous Learning Systems: From Data Streams to Knowledge in Real-time , 2013 .

[10] Bhiksha Raj,et al. Beyond Gaussian Pyramid: Multi-skip Feature Stacking for action recognition , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11] Xiaofeng Wang,et al. Human action recognition using transfer learning with deep representations , 2017, 2017 International Joint Conference on Neural Networks (IJCNN).

[12] Fei-Fei Li,et al. Large-Scale Video Classification with Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[13] Jan Kautz,et al. SENSE: A Shared Encoder Network for Scene-Flow Estimation , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[14] Trevor Darrell,et al. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[15] Jinhui Tang,et al. Weakly Supervised Deep Matrix Factorization for Social Image Understanding , 2017, IEEE Transactions on Image Processing.

[16] Tiejun Huang,et al. P-ODN: Prototype-based Open Deep Network for Open Set Recognition , 2019, Scientific Reports.

[17] M. Mildner,et al. Re-epithelialization and immune cell behaviour in an ex vivo human skin model , 2020, Scientific Reports.

[18] Xiaowei Gu,et al. A cascade of deep learning fuzzy rule-based image classifier and SVM , 2017, 2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[19] Tahir Cetin Akinci,et al. Evaluation of student performance in laboratory applications using fuzzy logic , 2010 .

[20] C. Krishna Mohan,et al. Coherent and Noncoherent Dictionaries for Action Recognition , 2017, IEEE Signal Processing Letters.

[21] Andrea Vedaldi,et al. Transactions on Pattern Analysis and Machine Intelligence 1 Action Recognition with Dynamic Image Networks , 2022 .

[22] Steven C. H. Hoi,et al. Face Detection using Deep Learning: An Improved Faster RCNN Approach , 2017, Neurocomputing.

[23] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[24] Chi Zhang,et al. What Elements are Essential to Recognize Human Actions? , 2019, CVPR Workshops.

[25] Sung Wook Baik,et al. Action recognition using optimized deep autoencoder and CNN for surveillance data streams of non-stationary environments , 2019, Future Gener. Comput. Syst..

[26] Plamen Angelov,et al. Human Action Recognition from Multiple Views Based on View-Invariant Feature Descriptor Using Support Vector Machines , 2016 .

[27] Lorenzo Torresani,et al. Learning Spatiotemporal Features with 3D Convolutional Networks , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[28] Amir Roshan Zamir,et al. Action Recognition in Realistic Sports Videos , 2014 .

[29] Yi Yang,et al. Semi-Supervised Multiple Feature Analysis for Action Recognition , 2014, IEEE Transactions on Multimedia.

[30] Limin Wang,et al. Bag of visual words and fusion methods for action recognition: Comprehensive study and good practice , 2014, Comput. Vis. Image Underst..

[31] Cordelia Schmid,et al. Long-Term Temporal Convolutions for Action Recognition , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32] Ming Yang,et al. 3D Convolutional Neural Networks for Human Action Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33] Chong Wang,et al. Deep Speech 2 : End-to-End Speech Recognition in English and Mandarin , 2015, ICML.

[34] P. Angelov,et al. Empirical Approach to Machine Learning , 2018, Studies in Computational Intelligence.

[35] Y-Lan Boureau,et al. Learning Convolutional Feature Hierarchies for Visual Recognition , 2010, NIPS.

[36] Muhammad Haroon Yousaf,et al. A Bag of Expression framework for improved human action recognition , 2018, Pattern Recognit. Lett..

[37] Cordelia Schmid,et al. Human Detection Using Oriented Histograms of Flow and Appearance , 2006, ECCV.

[38] Serge J. Belongie,et al. Behavior recognition via sparse spatio-temporal features , 2005, 2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance.

[39] Ivor W. Tsang,et al. DEFEATnet—A Deep Conventional Image Representation for Image Classification , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[40] Plamen P. Angelov,et al. Deep rule-based classifier with human-level performance and characteristics , 2018, Inf. Sci..

[41] Andrew Zisserman,et al. Convolutional Two-Stream Network Fusion for Video Action Recognition , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[42] WangLimin,et al. Bag of visual words and fusion methods for action recognition , 2016 .

[43] Ming Yang,et al. DeepFace: Closing the Gap to Human-Level Performance in Face Verification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[44] Jyh-Yeong Chang,et al. Fuzzy rule inference based human activity recognition , 2009, 2009 IEEE Control Applications, (CCA) & Intelligent Control, (ISIC).

[45] Hani Hagras,et al. A fuzzy logic-based system for the automation of human behavior recognition using machine vision in intelligent environments , 2015, Soft Comput..

[46] Cordelia Schmid,et al. Dense Trajectories and Motion Boundary Descriptors for Action Recognition , 2013, International Journal of Computer Vision.

[47] Jim Tørresen,et al. A Robust Human Activity Recognition Approach Using OpenPose, Motion Features, and Deep Recurrent Neural Network , 2019, SCIA.

[48] Bolei Zhou,et al. Learning Deep Features for Scene Recognition using Places Database , 2014, NIPS.

[49] Thomas Serre,et al. HMDB: A large video database for human motion recognition , 2011, 2011 International Conference on Computer Vision.

[50] Hanli Wang,et al. Motion keypoint trajectory and covariance descriptor for human action recognition , 2018, The Visual Computer.

[51] Feiping Nie,et al. Saliency Detection via a Multiple Self-Weighted Graph-Based Manifold Ranking , 2020, IEEE Transactions on Multimedia.

[52] Plamen P. Angelov,et al. Semi-supervised deep rule-based approach for image classification , 2018, Appl. Soft Comput..

[53] Plamen P. Angelov,et al. A Massively Parallel Deep Rule-Based Ensemble Classifier for Remote Sensing Scenes , 2018, IEEE Geoscience and Remote Sensing Letters.

[54] Andrew Zisserman,et al. Deep Face Recognition , 2015, BMVC.

[55] AizawaKiyoharu,et al. Efficient human action recognition using histograms of motion gradients and VLAD with descriptor shape information , 2017 .

[56] Yann LeCun,et al. Convolutional Learning of Spatio-temporal Features , 2010, ECCV.

[57] Junwei Han,et al. Synthesizing Supervision for Learning Deep Saliency Network without Human Annotation , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[58] Rama Chellappa,et al. Cross-View Action Recognition via Transferable Dictionary Learning , 2016, IEEE Transactions on Image Processing.

[59] S. Solberg,et al. Mapping patterns of abiotic and biotic stress resilience uncovers conservation gaps and breeding potential of Vigna wild relatives , 2020, Scientific Reports.

[60] Ronen Basri,et al. Actions as space-time shapes , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[61] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[62] Mubarak Shah,et al. Recognizing 50 human action categories of web videos , 2012, Machine Vision and Applications.

[63] Andrea Vedaldi,et al. Dynamic Image Networks for Action Recognition , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[64] Plamen P. Angelov,et al. A new type of simplified fuzzy rule-based system , 2012, Int. J. Gen. Syst..

[65] Cordelia Schmid,et al. Action Recognition with Improved Trajectories , 2013, 2013 IEEE International Conference on Computer Vision.

[66] Mubarak Shah,et al. Learning a Deep Model for Human Action Recognition from Novel Viewpoints , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[67] Plamen Angelov,et al. A Comprehensive Review on Handcrafted and Learning-Based Action Representation Approaches for Human Activity Recognition , 2017 .

[68] Theo Gevers,et al. Evaluation of Color STIPs for Human Action Recognition , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[69] Barbara Caputo,et al. Recognizing human actions: a local SVM approach , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[70] Cordelia Schmid,et al. A Robust and Efficient Video Representation for Action Recognition , 2015, International Journal of Computer Vision.

[71] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[72] Zhi-Qiang Liu,et al. Type-2 Fuzzy Topic Models for Human Action Recognition , 2015, IEEE Transactions on Fuzzy Systems.

[73] Dong Xu,et al. Advanced Deep-Learning Techniques for Salient and Category-Specific Object Detection: A Survey , 2018, IEEE Signal Processing Magazine.

[74] Nicu Sebe,et al. Efficient human action recognition using histograms of motion gradients and VLAD with descriptor shape information , 2017, Multimedia Tools and Applications.

[75] Bernadette Dorizzi,et al. Human activities of daily living recognition using fuzzy logic for elderly home monitoring , 2009, 2009 IEEE International Conference on Fuzzy Systems.