A recent survey for human activity recoginition based on deep learning approach

Human activity recognition is an active research topic in computer vision due to its applicability to wide range of application areas such as smart surveillance, robot learning, human computer interaction, health assessment. Identifying human activities from video sequences constitutes one of the most challenging tasks in the field of computer vision, especially due to the harsh nature of the real-world activity recognition scenarios and high volumes of data that need to be worked upon. The techniques available in the literature for activity recognition are broadly classified into two categories: single layered approaches aimed at recognition of much simpler activities, hierarchical approaches aimed at recognition of complex activities in terms of simpler ones. Deep learning is one of the hierarchical approaches for recognizing human activities capable of achieving outstanding results and outperforming other "non-deep" state-of-the-art methods by effectively utilizing the image structure in reducing the search space of the learning model. This paper aims at capturing a snapshot of current trends in activity recognition with deep learning models. We have also examined the merits, demerits, efficiency of pioneering deep learning models being used for activity recognition.

[1]  J.K. Aggarwal,et al.  Human activity analysis , 2011, ACM Comput. Surv..

[2]  Andrew Zisserman,et al.  Two-Stream Convolutional Networks for Action Recognition in Videos , 2014, NIPS.

[3]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[4]  Chalavadi Krishna Mohan,et al.  Human action recognition using genetic algorithms and convolutional neural networks , 2016, Pattern Recognit..

[5]  Md. Zia Uddin,et al.  A Depth Camera-based Human Activity Recognition via Deep Learning Recurrent Neural Network for Health and Social Care Services , 2016, CENTERIS/ProjMAN/HCist.

[6]  Kate Saenko,et al.  R-C3D: Region Convolutional 3D Network for Temporal Activity Detection , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[7]  Adrian Hilton,et al.  A survey of advances in vision-based human motion capture and analysis , 2006, Comput. Vis. Image Underst..

[8]  Yingdong Ma,et al.  Convolutional neural networks (CNN) for indoor human activity recognition using Ubisense system , 2017, 2017 29th Chinese Control And Decision Conference (CCDC).

[9]  Christian Wolf,et al.  Sequential Deep Learning for Human Action Recognition , 2011, HBU.

[10]  Adhavan Jayabalan,et al.  Dynamic Action Recognition: A convolutional neural network model for temporally organized joint location data , 2016, ArXiv.

[11]  Li Fei-Fei,et al.  Unsupervised Learning of Long-Term Motion Dynamics for Videos , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Lingfei Mo,et al.  Human physical activity recognition based on computer vision with deep learning model , 2016, 2016 IEEE International Instrumentation and Measurement Technology Conference Proceedings.

[13]  Jing Zhang,et al.  Action Recognition From Depth Maps Using Deep Convolutional Neural Networks , 2016, IEEE Transactions on Human-Machine Systems.

[14]  Gabriel Thomas,et al.  Human Activity Recognition using Binary Motion Image and Deep Learning , 2015, Procedia Computer Science.

[15]  Roger Leitzke Granada,et al.  A Deep Neural Architecture for Kitchen Activity Recognition , 2017, FLAIRS Conference.

[16]  Zhi Liu,et al.  3D-based Deep Convolutional Neural Network for action recognition with depth sequences , 2016, Image Vis. Comput..

[17]  Andrew Zisserman,et al.  Convolutional Two-Stream Network Fusion for Video Action Recognition , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  Ronald Poppe,et al.  A survey on vision-based human action recognition , 2010, Image Vis. Comput..

[19]  Fei-Fei Li,et al.  Large-Scale Video Classification with Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[20]  Cordelia Schmid,et al.  Long-Term Temporal Convolutions for Action Recognition , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Ming Yang,et al.  3D Convolutional Neural Networks for Human Action Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Michael S. Ryoo,et al.  Video-based convolutional neural networks for activity recognition from robot-centric videos , 2016, SPIE Defense + Security.