Analysis of Human Actions for Video Indexing

Automatically understanding human actions is crutial for efficiently indexing many types of videos, such as sports videos, home videos, movies etc. However, it is challenging due to their variances caused by different actors, different scales, and different views. In order to incorporate these variances, most methods in literature have to sacrifice the discriminability of action models. In this paper, we address the tradeoff between invariability and discriminability. We firstly propose a novel set of pixel-wise features which are invariant to actor appearances, scales, and motion directions. Then, multi-prototype action models are constructed to realize view invariance. By leaving the most challenging invariance from feature level to model level, we successfully maintain the discriminability of action models. The extensive experiments demonstrated the good performance of the proposed method.

[1]  Jitendra Malik,et al.  Recognizing action at a distance , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[2]  Jianbo Shi,et al.  Detecting unusual activity in video , 2004, CVPR 2004.

[3]  Gary R. Bradski,et al.  Motion segmentation and pose recognition with motion history gradients , 2000, Proceedings Fifth IEEE Workshop on Applications of Computer Vision.

[4]  Mubarak Shah,et al.  Actions sketch: a novel action representation , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[5]  Dana H. Ballard,et al.  Computer Vision , 1982 .

[6]  Shaogang Gong,et al.  Video behaviour profiling and abnormality detection without manual labelling , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[7]  Shaogang Gong,et al.  Beyond Tracking: Modelling Activity and Understanding Behaviour , 2006, International Journal of Computer Vision.

[8]  Ronen Basri,et al.  Actions as Space-Time Shapes , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  W. Eric L. Grimson,et al.  Learning Patterns of Activity Using Real-Time Tracking , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Tim J. Ellis,et al.  Spatial and Probabilistic Modelling of Pedestrian Behaviour , 2002, BMVC.

[11]  James W. Davis,et al.  The Recognition of Human Movement Using Temporal Templates , 2001, IEEE Trans. Pattern Anal. Mach. Intell..