Fusion of Single View Soft k-NN Classifiers for Multicamera Human Action Recognition

This paper presents two different classifier fusion algorithms applied in the domain of Human Action Recognition from video A set of cameras observes a person performing an action from a predefined set For each camera view a 2D descriptor is computed and a posterior on the performed activity is obtained using a soft classifier These posteriors are combined using voting and a bayesian network to obtain a single belief measure to use for the final decision on the performed action Experiments are conducted with different low level frame descriptors on the IXMAS dataset, achieving results comparable to state of the art 3D proposals, but only performing 2D processing.

[1]  Jitendra Malik,et al.  Recognizing action at a distance , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[2]  Daniel Boley,et al.  Human motion recognition using support vector machines , 2009, Comput. Vis. Image Underst..

[3]  Miguel A. Patricio,et al.  Non-supervised discovering of user activities in visual sensor networks for ambient intelligence applications , 2009, 2009 2nd International Symposium on Applied Sciences in Biomedical and Communication Technologies.

[4]  Andrew J. Davison,et al.  Active Matching , 2008, ECCV.

[5]  Ian D. Reid,et al.  A general method for human activity recognition in video , 2006, Comput. Vis. Image Underst..

[6]  Juan Carlos Niebles,et al.  Unsupervised Learning of Human Action Categories , 2006 .

[7]  Yiannis Aloimonos,et al.  A Language for Human Action , 2007, Computer.

[8]  Ehud Rivlin,et al.  Understanding Video Events: A Survey of Methods for Automatic Interpretation of Semantic Occurrences in Video , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[9]  Christopher M. Bishop,et al.  Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[10]  Cordelia Schmid,et al.  Actions in context , 2009, CVPR.

[11]  Du Tran,et al.  Human Activity Recognition with Metric Learning , 2008, ECCV.

[12]  James W. Davis,et al.  The Recognition of Human Movement Using Temporal Templates , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Avinash C. Kak,et al.  Distributed and lightweight multi-camera human activity classification , 2009, 2009 Third ACM/IEEE International Conference on Distributed Smart Cameras (ICDSC).

[14]  Trevor Darrell,et al.  Hidden Conditional Random Fields , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[16]  Alex Pentland,et al.  A Bayesian Computer Vision System for Modeling Human Interactions , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Trevor Darrell,et al.  Hidden-state Conditional Random Fields , 2006 .

[18]  Pedro Ribeiro,et al.  Human Activity Recognition from Video: modeling, feature selection and classification architecture , 2005 .

[19]  Rémi Ronfard,et al.  Free viewpoint action recognition using motion history volumes , 2006, Comput. Vis. Image Underst..

[20]  R. Cucchiara,et al.  Making the home safer and more secure through visual surveillance , 2005 .

[21]  Gang Qian,et al.  View-invariant full-body gesture recognition via multilinear analysis of voxel data , 2009, 2009 Third ACM/IEEE International Conference on Distributed Smart Cameras (ICDSC).