Action Recognition by Motion Trajectory Decomposition

Recognition of human actions in a video acquired by a moving camera typically requires standard preprocessing steps such as motion compensation, moving object detection and object tracking. The errors from the motion compensation step propagate to the object detection stage, resulting in miss-detections, which further complicates the tracking stage, resulting in cluttered and incorrect tracks. Therefore, action recognition from a moving camera is considered very challenging. In this chapter, we discuss an approach which does not follow the standard steps, and accordingly avoids the aforementioned difculties. The approach is based on Lagrangian particle trajectories which are a set of dense trajectories obtained by advecting optical flow over time, thus capturing the ensemble motions of a scene. This is done in frames of unaligned video, and no object detection is required. Consequently, in order to handle the moving camera, a low-rank trajectory decomposition approach is employed, where the trajectories are decomposed into their camera-induced and object-induced components. Having obtained the relevant object motion trajectories, a compact set of chaotic invariant features are computed, which captures the characteristics of the trajectories. Finally, a SVM is employed to learn and recognize the human actions using the computed motion features.

[1]  Mubarak Shah,et al.  View-Invariant Representation and Recognition of Actions , 2002, International Journal of Computer Vision.

[2]  Martial Hebert,et al.  Trajectons: Action recognition through the motion analysis of tracked features , 2009, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops.

[3]  Ce Liu,et al.  Exploring new representations and applications for motion analysis , 2009 .

[4]  Rangachar Kasturi,et al.  Activity recognition based on multiple motion trajectories , 2004, ICPR 2004.

[5]  Heinrich Niemann,et al.  Gait Classification with HMMs for Trajectories of Body Parts Extracted by Mixture Densities , 1998, BMVC.

[6]  Hui Cheng,et al.  Geo-spatial aerial video processing for scene understanding and object tracking , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Cordelia Schmid,et al.  Evaluation of Local Spatio-temporal Features for Action Recognition , 2009, BMVC.

[8]  Takeo Kanade,et al.  Background Subtraction for Freely Moving Cameras , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[9]  Mubarak Shah,et al.  Chaotic invariants of Lagrangian particle trajectories for anomaly detection in crowded scenes , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[10]  François Brémond,et al.  Gesture recognition by learning local motion signatures , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[11]  Shandong Wu,et al.  Flexible signature descriptions for adaptive motion trajectory representation, perception and recognition , 2009, Pattern Recognit..

[12]  Yi Ma,et al.  Robust principal component analysis? , 2009, JACM.

[13]  Cordelia Schmid,et al.  Action recognition by dense trajectories , 2011, CVPR 2011.

[14]  Mubarak Shah,et al.  Chaotic Invariants for Human Action Recognition , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[15]  Harpreet S. Sawhney,et al.  Vehicle detection and tracking in wide field-of-view aerial video , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[16]  Cordelia Schmid,et al.  Learning realistic human actions from movies , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  A. G. Amitha Perera,et al.  Joint Recognition of Complex Events and Track Matching , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[18]  Adriana Kovashka,et al.  Learning a hierarchy of discriminative space-time neighborhood features for human action recognition , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.