Action identification using a descriptor with autonomous fragments in a multilevel prediction scheme

Recent technological advances have provided powerful devices with high processing and storage capabilities. Video cameras can be found in several different areas, such as banks, schools, stores, public streets and industry for a variety of tasks. Camera technology has increasingly improved, achieving higher resolution and acquisition frame rates. Nevertheless, most video analysis tasks are performed by human operators, whose performance may be affected by fatigue and stress. To address this problem, this work proposes and evaluates a method for action identification in videos through a new descriptor composed of autonomous fragments applied to a multilevel prediction scheme. The method is very fast and achieves over 90 % of accuracy in known public data sets. The developed system allows for the current video cameras the possibility of real-time action analysis, demonstrating to be a useful and powerful tool for surveillance purpose.

[1]  Xinghua Sun,et al.  Action recognition via local descriptors and holistic features , 2009, 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[2]  B. S. Manjunath,et al.  Probabilistic subspace-based learning of shape dynamics modes for multi-view action recognition , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[3]  Ming Yang,et al.  3D Convolutional Neural Networks for Human Action Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Hossein Ragheb,et al.  MuHAVi: A Multicamera Human Action Video Dataset for the Evaluation of Action Recognition Methods , 2010, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[5]  Ronen Basri,et al.  Actions as space-time shapes , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[6]  Alexandros André Chaaraoui,et al.  Human action recognition optimization based on evolutionary feature subset selection , 2013, GECCO '13.

[7]  Chen Wu,et al.  Multiview activity recognition in smart homes with spatio-temporal features , 2010, ICDSC '10.

[8]  Janusz Konrad,et al.  Action Recognition From Video Using Feature Covariance Matrices , 2013, IEEE Transactions on Image Processing.

[9]  Guo-Can Feng,et al.  Learning Pose Dictionary for Human Action Recognition , 2014, 2014 22nd International Conference on Pattern Recognition.

[10]  Patrick Pérez,et al.  Joint pose estimation and action recognition in image graphs , 2011, 2011 18th IEEE International Conference on Image Processing.

[11]  Tieniu Tan,et al.  A compact optical flowbased motion representation for real-time action recognition in surveillance scenes , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[12]  Massimo Piccardi,et al.  Histogram-Based Training Initialisation of Hidden Markov Models for Human Action Recognition , 2010, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[13]  Barbara Caputo,et al.  Recognizing human actions: a local SVM approach , 2004, ICPR 2004.

[14]  Ivan Laptev,et al.  On Space-Time Interest Points , 2005, International Journal of Computer Vision.

[15]  Dacheng Tao,et al.  Slow Feature Analysis for Human Action Recognition , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Hélio Pedrini,et al.  Real-time action recognition based on cumulative Motion shapes , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[17]  Leonardo Onofri,et al.  Combining video subsequences for human action recognition , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[18]  Jake K. Aggarwal,et al.  Spatio-temporal relationship match: Video structure comparison for recognition of complex human activities , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[19]  Imran N. Junejo,et al.  Using SAX representation for human action recognition , 2012, J. Vis. Commun. Image Represent..

[20]  Shaogang Gong,et al.  Fusing appearance and distribution information of interest points for action recognition , 2012, Pattern Recognit..

[21]  P. KaewTrakulPong,et al.  An Improved Adaptive Background Mixture Model for Real-time Tracking with Shadow Detection , 2002 .

[22]  Jean-Michel Jolion,et al.  Pairwise Features for Human Action Recognition , 2010, 2010 20th International Conference on Pattern Recognition.

[23]  Alexandros André Chaaraoui,et al.  Silhouette-based human action recognition using sequences of key poses , 2013, Pattern Recognit. Lett..

[24]  Hafiz Imtiaz,et al.  Action recognition based on statistical analysis from clustered flow vectors , 2014, Signal Image Video Process..

[25]  Du-Ming Tsai,et al.  Optical flow-motion history image (OF-MHI) for action recognition , 2015, Signal Image Video Process..

[26]  Massimo Piccardi,et al.  Training Initialization of Hidden Markov Models in Human Action Recognition , 2014, IEEE Transactions on Automation Science and Engineering.

[27]  Christian Bauckhage,et al.  Action recognition by learning discriminative key poses , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[28]  Hélio Pedrini,et al.  Motion Silhouette-Based Real Time Action Recognition , 2013, CIARP.

[29]  Chaur-Heh Hsieh,et al.  Human action recognition using silhouette histogram , 2011 .