Human Pose Recognition Based on Depth Image Multifeature Fusion

The recognition of human pose based on machine vision usually results in a low recognition rate, low robustness, and low operating efficiency. That is mainly caused by the complexity of the background, as well as the diversity of human pose, occlusion, and self-occlusion. To solve this problem, a feature extraction method combining directional gradient of depth feature (DGoD) and local difference of depth feature (LDoD) is proposed in this paper, which uses a novel strategy that incorporates eight neighborhood points around a pixel for mutual comparison to calculate the difference between the pixels. A new data set is then established to train the random forest classifier, and a random forest two-way voting mechanism is adopted to classify the pixels on different parts of the human body depth image. Finally, the gravity center of each part is calculated and a reasonable point is selected as the joint to extract human skeleton. The experimental results show that the robustness and accuracy are significantly improved, associated with a competitive operating efficiency by evaluating our approach with the proposed data set.

[1]  K Kurita,et al.  A New Motion Control Method for Bipedal Robot Based on Noncontact and Nonattached Human Motion Sensing Technique , 2011, IEEE Transactions on Industry Applications.

[2]  Andrew Blake,et al.  Efficient Human Pose Estimation from Single Depth Images , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  W. Marsden I and J , 2012 .

[4]  Tao Chen,et al.  Semantic segmentation of RGBD images based on deep depth regression , 2018, Pattern Recognit. Lett..

[5]  Andrew W. Fitzgibbon,et al.  Real-time human pose recognition in parts from single depth images , 2011, CVPR 2011.

[6]  Sven J. Dickinson,et al.  Recognize Human Activities from Partially Observed Videos , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  K. Kurita,et al.  A new motion control method for bipedal robot based on non-contact and non-attached human motion sensing technique , 2009, 2009 International Conference on Electrical Machines and Systems.

[8]  Christian Szegedy,et al.  DeepPose: Human Pose Estimation via Deep Neural Networks , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Mathieu Aubry,et al.  Convolutional Neural Networks for joint object detection and pose estimation: A comparative study , 2014, ArXiv.

[10]  Silvio Savarese,et al.  Articulated part-based model for joint object detection and pose estimation , 2011, 2011 International Conference on Computer Vision.

[11]  Timothy F. Cootes,et al.  Fully Automatic Segmentation of the Proximal Femur Using Random Forest Regression Voting , 2013, IEEE Transactions on Medical Imaging.

[12]  Wolfram Burgard,et al.  Deep learning for human part discovery in images , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[13]  Shu Liao,et al.  Multi-Instance Deep Learning: Discover Discriminative Local Anatomies for Bodypart Recognition , 2016, IEEE Transactions on Medical Imaging.

[14]  Mohan M. Trivedi,et al.  Head Pose Estimation in Computer Vision: A Survey , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Hyun Myung,et al.  Real-Time Human Pose Estimation and Gesture Recognition from Depth Images Using Superpixels and SVM Classifier , 2015, Sensors.

[16]  Sankar K. Pal,et al.  A review on image segmentation techniques , 1993, Pattern Recognit..

[17]  Maode Ma,et al.  Intelligent Image Recognition System for Marine Fouling Using Softmax Transfer Learning and Deep Convolutional Neural Networks , 2017, Complex..

[18]  Yi Yang,et al.  Articulated Human Detection with Flexible Mixtures of Parts , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Antonis A. Argyros,et al.  Tracking the articulated motion of the human body with two RGBD cameras , 2014, Machine Vision and Applications.

[20]  Cordelia Schmid,et al.  Expanded Parts Model for Semantic Description of Humans in Still Images , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Je-Won Kang,et al.  Combining random forest with multi-block local binary pattern feature selection for multiclass head pose estimation , 2017, PloS one.

[22]  Zhaoyang Lu,et al.  Transferable Feature Representation for Visible-to-Infrared Cross-Dataset Human Action Recognition , 2018, Complex..

[23]  Jon Atli Benediktsson,et al.  Hyperspectral Image Classification With Rotation Random Forest Via KPCA , 2017, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[24]  James J. Little,et al.  A Simple Yet Effective Baseline for 3d Human Pose Estimation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[25]  Andrew Hynes,et al.  Human Part Segmentation in Depth Images with Annotated Part Positions , 2018, Sensors.

[26]  Claudia Lindner,et al.  Robust and Accurate Shape Model Matching Using Random Forest Regression-Voting. , 2015, IEEE transactions on pattern analysis and machine intelligence.

[27]  Andrea Sanna,et al.  Semantics-Based Intelligent Human-Computer Interaction , 2016, IEEE Intelligent Systems.

[28]  Yi Yang,et al.  Depth-Based Hand Pose Estimation: Methods, Data, and Challenges , 2015, International Journal of Computer Vision.

[29]  Sergio Escalera,et al.  Graph cuts optimization for multi-limb human segmentation in depth maps , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[30]  Gustavo Carneiro,et al.  Formulating semantic image annotation as a supervised learning problem , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[31]  Francesc Moreno-Noguer,et al.  3D Human Pose Estimation from a Single Image via Distance Matrix Regression , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[32]  Ana-Maria Cretu,et al.  Static and Dynamic Hand Gesture Recognition in Depth Data Using Dynamic Time Warping , 2016, IEEE Transactions on Instrumentation and Measurement.

[33]  Xi Chen,et al.  Precision Security: Integrating Video Surveillance with Surrounding Environment Changes , 2018, Complex..

[34]  Leonidas J. Guibas,et al.  Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[35]  Gustavo Carneiro,et al.  Supervised Learning of Semantic Classes for Image Annotation and Retrieval , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36]  Marwan Torki,et al.  Histogram of Oriented Displacements (HOD): Describing Trajectories of Human Joints for Action Recognition , 2013, IJCAI.

[37]  Wei-Yun Yau,et al.  Improving human body part detection using deep learning and motion consistency , 2016, 2016 14th International Conference on Control, Automation, Robotics and Vision (ICARCV).

[38]  K. Nishi,et al.  Generation of human depth images with body part labels for complex human pose recognition , 2017, Pattern Recognit..

[39]  Christoph Rasche,et al.  Rapid contour detection for image classification , 2017, IET Image Process..

[40]  Bernt Schiele,et al.  Articulated people detection and pose estimation: Reshaping the future , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[41]  Zhenjun Tang,et al.  Combining Generative/Discriminative Learning for Automatic Image Annotation and Retrieval , 2012 .

[42]  Chi Xu,et al.  Mouse Pose Estimation From Depth Images , 2015, ArXiv.

[43]  Xian Sun,et al.  Multi-view semi-supervised learning for image classification , 2016, Neurocomputing.

[44]  Jakub Konecný,et al.  One-shot-learning gesture recognition using HOG-HOF features , 2014, J. Mach. Learn. Res..

[45]  Gérard G. Medioni,et al.  Human pose estimation from a single view point, real-time range sensor , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[46]  Prabhas Chongstitvatana,et al.  Application of structured support vector machine backpropagation to a convolutional neural network for human pose estimation , 2017, Neural Networks.

[47]  Qingquan Li,et al.  Improving Land Use/Land Cover Classification by Integrating Pixel Unmixing and Decision Tree Methods , 2017, Remote. Sens..