Human pose modelling and body tracking from monocular video sequences

This paper proposes a computer vision-based approach to automatically detect human body parts and estimate the human body poses efficiently from a monocular video sequence. Human body parts detection is performed using colour, contours and silhouettes cues. It determines the 2D spatial locations of joints of a human body without any special markers on the body. The input image is segmented using silhouette extraction function to obtain a silhouette human figure. A parametric skin distribution modelling method is then utilized to detect the face and limbs of a person. Hue-Saturation-Value (HSV) colour space is chosen for this application. Radon transform is used to get more accurate orientation of the upper arms when they are inside the body perimeter. Various physical and motion constraints regarding the human body is then used to construct the upper body configuration. Our algorithm can estimate poses for the person wearing short sleeve and long sleeve shirt. It could estimate the human poses even under illumination changes, self-occlusion occurrence, and distance variations. Then, seventeen body poses are considered for classification. Eight features are extracted from each pose. Feed-forward neural network is used to classify the defined human body poses.

[1]  David J. Fleet,et al.  Performance of optical flow techniques , 1994, International Journal of Computer Vision.

[2]  Yang Song,et al.  Unsupervised Learning of Human Motion , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Cristian Sminchisescu,et al.  Covariance scaled sampling for monocular 3D body tracking , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[4]  Mubarak Shah,et al.  Person-on-person violence detection in video data , 2002, Object recognition supported by user interaction for service robots.

[5]  David A. Forsyth,et al.  Probabilistic Methods for Finding People , 2001, International Journal of Computer Vision.

[6]  Javier Varona,et al.  Automatic Human Body Modeling for Vision-Based Motion Capture , 2012 .

[7]  Satoshi Nakagawa,et al.  Automated detection of human for visual surveillance system , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[8]  W. Eric L. Grimson,et al.  Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[9]  Dirk Heylen,et al.  Towards real-time Body Pose Estimation for Presenters in Meeting Environments , 2005, WSCG.

[10]  Hironobu Fujiyoshi,et al.  Moving target classification and tracking from real-time video , 1998, Proceedings Fourth IEEE Workshop on Applications of Computer Vision. WACV'98 (Cat. No.98EX201).

[11]  Mun Wai Lee,et al.  Proposal maps driven MCMC for estimating human body pose in static images , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..