Human Body 3D Posture Estimation Using Significant Points and Two Cameras

This paper proposes a three-dimensional (3D) human posture estimation system that locates 3D significant body points based on 2D body contours extracted from two cameras without using any depth sensors. The 3D significant body points that are located by this system include the head, the center of the body, the tips of the feet, the tips of the hands, the elbows, and the knees. First, a linear support vector machine- (SVM-) based segmentation method is proposed to distinguish the human body from the background in red, green, and blue (RGB) color space. The SVM-based segmentation method uses not only normalized color differences but also included angle between pixels in the current frame and the background in order to reduce shadow influence. After segmentation, 2D significant points in each of the two extracted images are located. A significant point volume matching (SPVM) method is then proposed to reconstruct the 3D significant body point locations by using 2D posture estimation results. Experimental results show that the proposed SVM-based segmentation method shows better performance than other gray level- and RGB-based segmentation approaches. This paper also shows the effectiveness of the 3D posture estimation results in different postures.

[1]  Yili Liu,et al.  Using the Support Vector Regression Approach to Model Human Performance , 2011, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[2]  Rita Cucchiara,et al.  Probabilistic posture classification for Human-behavior analysis , 2005, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[3]  Jun-Wei Hsieh,et al.  Segmentation of Human Body Parts Using Deformable Triangulation , 2006, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[4]  Guanrong Chen,et al.  Kalman Filtering with Real-time Applications , 1987 .

[5]  Xiaojun Wu,et al.  Real-time dynamic 3-D object shape reconstruction and high-fidelity texture mapping for 3-D video , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  Takeo Kanade,et al.  A real time system for robust 3D voxel reconstruction of human motions , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[7]  Chia-Feng Juang,et al.  A TS Fuzzy System Learned Through a Support Vector Machine in Principal Component Space for Real-Time Object Detection , 2012, IEEE Transactions on Industrial Electronics.

[8]  Chia-Feng Juang,et al.  A Self-Organizing TS-Type Fuzzy Network With Support Vector Learning and its Application to Classification Problems , 2007, IEEE Transactions on Fuzzy Systems.

[9]  Salvatore Sessa,et al.  Towards Miniaturization of a MEMS-Based Wearable Motion Capture System , 2011, IEEE Transactions on Industrial Electronics.

[10]  Liang-Gee Chen,et al.  Efficient moving object segmentation algorithm using background registration technique , 2002, IEEE Trans. Circuits Syst. Video Technol..

[11]  Michael Potmesil Generating octree models of 3D objects from their silhouettes in a sequence of images , 1987, Comput. Vis. Graph. Image Process..

[12]  Yillbyung Lee,et al.  The Relational Properties among Results of Background Subtraction , 2008, 2008 10th International Conference on Advanced Communication Technology.

[13]  Luc Van Gool,et al.  Drift detection and removal for sequential structure from motion algorithms , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Chia-Feng Juang,et al.  Object Detection Using Color Entropies and a Fuzzy Classifier , 2013, IEEE Computational Intelligence Magazine.

[15]  Jonathan H. Connell,et al.  A Statistical Approach for Real-time Robust Background Subtrac tion and Shadow Detection , 2014 .

[16]  Quming Zhou,et al.  Tracking and Classifying Moving Objects from Video , 2001 .

[17]  M.T. Razali,et al.  Detection and Classification of Moving Object for Smart Vision Sensor , 2006, 2006 2nd International Conference on Information & Communication Technologies.

[18]  Kazuhiko Takahashi,et al.  Remarks on 3D Human Posture Estimation System Using Simple Multi-Camera System , 2006, 2006 IEEE International Conference on Systems, Man and Cybernetics.

[19]  Kajiro Watanabe,et al.  Kinematical analysis and measurement of sports form , 2006, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[20]  Chia-Feng Juang,et al.  Speedup of Implementing Fuzzy Neural Networks With High-Dimensional Inputs Through Parallel Processing on Graphic Processing Units , 2011, IEEE Transactions on Fuzzy Systems.

[21]  James M. Keller,et al.  Modeling Human Activity From Voxel Person Using Fuzzy Logic , 2009, IEEE Transactions on Fuzzy Systems.

[22]  Csaba Benedek,et al.  Study on color space selection for detecting cast shadows in video surveillance , 2007, Int. J. Imaging Syst. Technol..

[23]  Chia-Feng Juang,et al.  Face localization using fuzzy classifier with wavelet-localized focus color features and shape features , 2012, Digit. Signal Process..

[24]  Rómer Rosales,et al.  Estimating 3D body pose using uncalibrated cameras , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[25]  A. Murat Tekalp,et al.  Audio-driven human body motion analysis and synthesis , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[26]  Takeshi Ohashi,et al.  Motion generator approach to translating human motion from video to animation , 1999, Proceedings. Seventh Pacific Conference on Computer Graphics and Applications (Cat. No.PR00293).

[27]  Jun Ohya,et al.  Human figure synthesis and animation for virtual space teleconferencing , 1995, Proceedings Virtual Reality Annual International Symposium '95.

[28]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[29]  Jake K. Aggarwal,et al.  TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE , 2008 .

[30]  D. Lee,et al.  Computer Vision-Based Human Body Segmentation and Posture Estimation , 2009, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[31]  Chia-Feng Juang,et al.  A Self-Evolving Interval Type-2 Fuzzy Neural Network With Online Structure and Parameter Learning , 2008, IEEE Transactions on Fuzzy Systems.

[32]  Ehud Rivlin,et al.  Understanding Video Events: A Survey of Methods for Automatic Interpretation of Semantic Occurrences in Video , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[33]  Philip Birch,et al.  Illumination invariant stationary object detection , 2013, IET Comput. Vis..

[34]  Mohan M. Trivedi,et al.  Articulated body posture estimation from multi-camera voxel data , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[35]  Susan Hackwood,et al.  Computational geometric methods in volumetric intersection for 3D reconstruction , 1989, Proceedings, 1989 International Conference on Robotics and Automation.

[36]  Chia-Feng Juang,et al.  Fuzzy System Learned Through Fuzzy Clustering and Support Vector Machine for Human Skin Color Segmentation , 2007, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[37]  E. R. Davies,et al.  Machine vision - theory, algorithms, practicalities , 2004 .

[38]  Anup Basu,et al.  Human Activity Recognition Based on Silhouette Directionality , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[39]  Larry S. Davis,et al.  W4: Real-Time Surveillance of People and Their Activities , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[40]  Chia-Feng Juang,et al.  Human Body Posture Classification by a Neural Fuzzy Network and Home Care System Application , 2007, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.