New paradigm for recognition of aggressive human behavior based on bag-of-features and skeleton graph

The human action recognition is an active field on computer vision in the last decade. It consists to automatically identify of human behavior and interpreting their actions. In this paper, we propose a new paradigm to recognize aggressive human behavior based on two models. The first method is based on shape representation by using bag-of-features approach and the second method is based on the skeleton graph in order to extract motion features. The feature association of the two models is carried out at each frame of atomic action. Thus, an appropriate label is assigned to each feature association vector by using an offline clustering algorithm such as k-means. The obtained feature vectors are conducted from a sequence video by using a set of labels as an optimum codebook. The aggressive behaviors are then recognized by applying a support vector machine classifier. The proposed algorithm enables robust recognition in very challenging situations such as dynamic environment and deals well with self-occlusion problem. Experimental results are conducted on KTH dataset actions and demonstrate that the proposed approach provide significant recognition rate of 96%.

[1]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[2]  Larry S. Davis,et al.  W4: Real-Time Surveillance of People and Their Activities , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  François Brémond,et al.  Video-understanding framework for automatic behavior recognition , 2006, Behavior research methods.

[4]  Tae-Kyun Kim,et al.  Canonical Correlation Analysis of Video Volume Tensors for Action Categorization and Detection , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Larry S. Davis,et al.  Recognizing actions by shape-motion prototype trees , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[6]  Chih-Jen Lin,et al.  A comparison of methods for multiclass support vector machines , 2002, IEEE Trans. Neural Networks.

[7]  Janusz Konrad,et al.  CHAPTER 3 – Motion Detection and Estimation , 2009 .

[8]  Herbert Freeman,et al.  On the Encoding of Arbitrary Geometric Configurations , 1961, IRE Trans. Electron. Comput..

[9]  Koichiro Akita,et al.  Image sequence analysis of real world human motion , 1984, Pattern Recognit..

[10]  Cordelia Schmid,et al.  Learning realistic human actions from movies , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Paul A. Viola,et al.  Detecting Pedestrians Using Patterns of Motion and Appearance , 2005, International Journal of Computer Vision.

[12]  Mubarak Shah,et al.  Human Action Recognition in Videos Using Kinematic Features and Multiple Instance Learning , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[14]  Li Yao,et al.  A distance map based skeletonization algorithm and its application in fiber recognition , 2008, 2008 International Conference on Audio, Language and Image Processing.

[15]  Luciano da Fontoura Costa,et al.  On Voronoi Diagrams and Medial Axes , 2002, Journal of Mathematical Imaging and Vision.

[16]  Alberto Del Bimbo,et al.  Effective Codebooks for human action categorization , 2009, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops.

[17]  Vittorio Murino,et al.  Background Subtraction for Automated Multisensor Surveillance: A Comprehensive Review , 2010, EURASIP J. Adv. Signal Process..

[18]  Janusz Konrad,et al.  3.10 – Motion Detection and Estimation , 2005 .

[19]  Yaser Sheikh,et al.  Matching Trajectories of Anatomical Landmarks Under Viewpoint, Anthropometric and Temporal Transforms , 2009, International Journal of Computer Vision.

[20]  Liang-Tien Chia,et al.  Motion Context: A New Representation for Human Action Recognition , 2008, ECCV.

[21]  Adriana Kovashka,et al.  Learning a hierarchy of discriminative space-time neighborhood features for human action recognition , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[22]  Barbara Caputo,et al.  Recognizing human actions: a local SVM approach , 2004, ICPR 2004.

[23]  Adam Krzyzak,et al.  Piecewise Linear Skeletonization Using Principal Curves , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[24]  Urs Ramer,et al.  An iterative procedure for the polygonal approximation of plane curves , 1972, Comput. Graph. Image Process..

[25]  Hatice Gunes,et al.  Bi-modal emotion recognition from expressive face and body gestures , 2007, J. Netw. Comput. Appl..

[26]  Sukmoon Chang Extracting Skeletons from Distance Maps , 2007 .