A realtime shrug detector

A realtime system for shrug detection is discussed in this paper. The system is automatically initialized by a face detector based on Ada-boost [P. Viola and M. Jones, May 2004]. After frontal face is localized by the face detector, shoulder position is detected by fitting a parabola to the nearby horizontal edges using weighted Hough transform [K. Sugawara, 1997]. Since shrug is an action which is defined not only by the distance between face and shoulder but also the relative temporal-spatial changing between them, we propose a parameterizing scheme using two different parabolas, named as "stable parabola" (SP) and "transient parabola" (TP) to characterize the action shrug. Stable parabola represents the mean shoulder position over a long time duration, while transient parabola represents the mean shoulder position of a very short time duration. By using this scheme (only 6 dimensions), we avoid the high dimensional representation of the temporal process-shrug, and therefore make the realtime implementation possible. The shrug detector is then trained in the parameter space using Fisher discriminant analysis (FDA). The experiments show that the proposed shrug detector is able to not only detect the shrug action correctly and efficiently (in realtime), but also tolerate the large in-class variation caused by different subject, different action speed, illumination, partial occlusion, and background clutter. So the proposed realtime shrug detector is promising in video analysis under an uncontrolled environment

[1]  David G. Stork,et al.  Pattern Classification , 1973 .

[2]  Cristian Sminchisescu,et al.  Covariance scaled sampling for monocular 3D body tracking , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[3]  David G. Stork,et al.  Pattern classification, 2nd Edition , 2000 .

[4]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[5]  Michael I. Mandel,et al.  Distributed Occlusion Reasoning for Tracking with Nonparametric Belief Propagation , 2004, NIPS.

[6]  Daniel P. Huttenlocher,et al.  Efficient matching of pictorial structures , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[7]  David A. Forsyth,et al.  Finding and tracking people from the bottom up , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[8]  Sidharth Bhatia,et al.  Tracking loose-limbed people , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[9]  Jitendra Malik,et al.  Tracking people with twists and exponential maps , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[10]  Kazuhide Sugawara Weighted Hough transform on a gridded image plane , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[11]  Rómer Rosales,et al.  Inferring body pose without tracking body parts , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[12]  Jake K. Aggarwal,et al.  Human Motion Analysis: A Review , 1999, Comput. Vis. Image Underst..

[13]  Andrew Blake,et al.  Articulated body motion capture by annealed particle filtering , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[14]  Yang Song,et al.  Unsupervised Learning of Human Motion , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  James W. Davis,et al.  The Recognition of Human Movement Using Temporal Templates , 2001, IEEE Trans. Pattern Anal. Mach. Intell..