On the use of anthropometry in the invariant analysis of human actions

We propose a novel approach to matching human actions using semantic correspondence between human bodies with an eye towards invariant analysis of activity. The correspondences are used to provide geometric constraints between multiple anatomical landmarks (e.g. hands, shoulders and feet) to match actions performed from different viewpoints and in different environments. The fact that the human body has certain anthropometric proportion allows innovative use of the machinery of epipolar geometry to provide constraints to accurately analyze actions performed by different people leading to some interesting results. Temporally invariant matching is performed, using non-linear time warping, to ensure that similar actions performed at different rates are accurately matched as well. Thus, the proposed algorithm guarantees that both temporal and view invariance is maintained in matching. We demonstrate the versatility of our algorithm in a number of challenging sequences and applications.

[1]  G. Johansson Visual perception of biological motion and a model for its analysis , 1973 .

[2]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[3]  Robert W. Bailey,et al.  Human performance engineering: A guide for system designers , 1982 .

[4]  Karl H.E. Kroemer,et al.  Anthropometry and biomechanics : theory and application , 1982 .

[5]  Karl H.E. Kroemer,et al.  Anthropometry and Biomechanics , 1982 .

[6]  Andrew Zisserman,et al.  Geometric invariance in computer vision , 1992 .

[7]  Stan Sclaroff,et al.  Trajectory Guided Tracking and Recognition of Actions , 1999 .

[8]  Dariu Gavrila,et al.  The Visual Analysis of Human Movement: A Survey , 1999, Comput. Vis. Image Underst..

[9]  Rómer Rosales,et al.  Inferring body pose without tracking body parts , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[10]  M. Irani,et al.  Event-Based Video Analysis, , 2001 .

[11]  Lihi Zelnik-Manor,et al.  Event-based analysis of video , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[12]  Jitendra Malik,et al.  Recognizing action at a distance , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[13]  Rama Chellappa,et al.  View invariants for human action recognition , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[14]  Tanveer F. Syeda-Mahmood,et al.  View-invariant alignment and matching of video sequences , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[15]  Steven M. Seitz,et al.  View-Invariant Analysis of Cyclic Motion , 1997, International Journal of Computer Vision.

[16]  Robert Bridger Introduction to Ergonomics , 2008 .