Representation and Recognition of Human Actions - a New Approach based on an Optimal Control Motor Model

We present a novel approach to the problem of representation and recognition of human actions, that uses an optimal control based model to connect the high-level goals of a human subject to the low-level movement trajectories captured by a computer vision system. These models quantify the high-level goals as a performance criterion or cost function which the human sensorimotor system optimizes by picking the control strategy that achieves the best possible performance. We show that the human body can be modeled as a hybrid linear system that can operate in one of several possible modes, where each mode corresponds to a particular highlevel goal or cost function. The problem of action recognition, then is to infer the current mode of the system from observations of the movement trajectory. We demonstrate our approach on 3D visual data of human arm

[1]  Michael K. Pitt,et al.  Auxiliary Variable Based Particle Filters , 2001, Sequential Monte Carlo Methods in Practice.

[2]  George W. Irwin,et al.  Multiple model bootstrap filter for maneuvering target tracking , 2000, IEEE Trans. Aerosp. Electron. Syst..

[3]  E. Todorov Optimality principles in sensorimotor control , 2004, Nature Neuroscience.

[4]  Christoph Bregler,et al.  Learning and recognizing human dynamics in video sequences , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[5]  Jake K. Aggarwal,et al.  A hierarchical Bayesian network for event recognition of human actions and interactions , 2004, Multimedia Systems.

[6]  Jessica K. Hodgins,et al.  Synthesizing physically realistic human motion in low-dimensional, behavior-specific spaces , 2004, ACM Trans. Graph..

[7]  N. de Freitas Rao-Blackwellised particle filtering for fault diagnosis , 2002, Proceedings, IEEE Aerospace Conference.

[8]  R.M. Murray,et al.  Segmentation of human motion into dynamics based primitives with application to drawing tasks , 2003, Proceedings of the 2003 American Control Conference, 2003..

[9]  Emanuel Todorov,et al.  Iterative Linear Quadratic Regulator Design for Nonlinear Biological Movement Systems , 2004, ICINCO.

[10]  Daniel M. Wolpert,et al.  Signal-dependent noise determines motor planning , 1998, Nature.

[11]  S. Scott Optimal feedback control and the neural basis of volitional motor control , 2004, Nature Reviews Neuroscience.

[12]  N. D. Freitas Rao-Blackwellised particle filtering for fault diagnosis , 2002 .

[13]  Ruggero Frezza,et al.  Control of a Manipulator with a Minimum Number of Motion Primitives , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.

[14]  Rémi Ronfard,et al.  Automatic Discovery of Action Taxonomies from Multiple Views , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[15]  Eric Horvitz,et al.  Layered representations for learning and inferring office activity from multiple sensory channels , 2004, Comput. Vis. Image Underst..

[16]  Maja J. Mataric,et al.  Automated Derivation of Primitives for Movement Classification , 2000, Auton. Robots.

[17]  Y. Bar-Shalom,et al.  The interacting multiple model algorithm for systems with Markovian switching coefficients , 1988 .