Independent increment processes for human motion recognition

This paper describes an algorithm for classifying human motion patterns (trajectories) observed in video sequences. We address this task in a hierarchical way: high-level activities are described as sequences of low-level motion patterns (dynamic models). These low-level dynamic models are simply independent increment processes, each describing a specific motion regime (e.g., ''moving left''). Classifying a trajectory thus consists in segmenting it into the sequence its low-level components; each sequence of low-level components corresponds to a high-level activity. To perform the segmentation, we introduce a penalized maximum-likelihood criterion which is able to select the number of segments via a novel MDL-type penalty. Experiments with synthetic and real data illustrate the effectiveness of the proposed approach.

[1]  Alex Pentland,et al.  Pfinder: Real-Time Tracking of the Human Body , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Jorma Rissanen,et al.  Stochastic Complexity in Statistical Inquiry , 1989, World Scientific Series in Computer Science.

[3]  Jitendra Malik,et al.  Traffic Surveillance And Detection Technology Development: New Traffic Sensor Technology Final Report , 1997 .

[4]  David C. Hogg,et al.  Learning Deformable Models for Tracking the Human Body , 1997 .

[5]  Jorge S. Marques,et al.  Robust shape tracking in the presence of cluttered background , 2000, IEEE Transactions on Multimedia.

[6]  T. Mexia,et al.  Author ' s personal copy , 2009 .

[7]  Larry S. Davis,et al.  Human expression recognition from motion using a radial basis function network architecture , 1996, IEEE Trans. Neural Networks.

[8]  Tieniu Tan,et al.  A survey on visual surveillance of object motion and behaviors , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[9]  David C. Hogg,et al.  Learning the Distribution of Object Trajectories for Event Recognition , 1995, BMVC.

[10]  Yvan G. Leclerc,et al.  Constructing simple stable descriptions for image partitioning , 1989, International Journal of Computer Vision.

[11]  Mubarak Shah,et al.  Monitoring human behavior from video taken in an office environment , 2001, Image Vis. Comput..

[12]  Alex Pentland,et al.  A Bayesian Computer Vision System for Modeling Human Interactions , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Tapas Kanungo,et al.  A fast algorithm for MDL-based multi-band image segmentation , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[14]  James W. Davis,et al.  The Recognition of Human Movement Using Temporal Templates , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  Mubarak Shah,et al.  Visual gesture recognition , 1994 .

[16]  Pedro Ribeiro,et al.  Human Activity Recognition from Video: modeling, feature selection and classification architecture , 2005 .

[17]  Michael Isard,et al.  A mixed-state condensation tracker with automatic model-switching , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[18]  Osama Masoud,et al.  A method for human action recognition , 2003, Image Vis. Comput..

[19]  Jorge S. Marques,et al.  Optimal and suboptimal shape tracking based on multiple switched dynamic models , 2001, Image Vis. Comput..

[20]  Mubarak Shah,et al.  Motion-Based Recognition , 1997, Computational Imaging and Vision.

[21]  Aaron F. Bobick,et al.  Recognition of Visual Activities and Interactions by Stochastic Parsing , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  Henry A. Kautz,et al.  Learning and inferring transportation routines , 2004, Artif. Intell..

[23]  Mário A. T. Figueiredo,et al.  Recognition of human activities using space dependent switched dynamical models , 2005, IEEE International Conference on Image Processing 2005.

[24]  Michael J. Black,et al.  Parameterized Modeling and Recognition of Activities , 1999, Comput. Vis. Image Underst..

[25]  Ramakant Nevatia,et al.  Multi-agent event recognition , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.