论文信息 - Human Pose Recognition Based on Depth Image Multifeature Fusion

Human Pose Recognition Based on Depth Image Multifeature Fusion

The recognition of human pose based on machine vision usually results in a low recognition rate, low robustness, and low operating efficiency. That is mainly caused by the complexity of the background, as well as the diversity of human pose, occlusion, and self-occlusion. To solve this problem, a feature extraction method combining directional gradient of depth feature (DGoD) and local difference of depth feature (LDoD) is proposed in this paper, which uses a novel strategy that incorporates eight neighborhood points around a pixel for mutual comparison to calculate the difference between the pixels. A new data set is then established to train the random forest classifier, and a random forest two-way voting mechanism is adopted to classify the pixels on different parts of the human body depth image. Finally, the gravity center of each part is calculated and a reasonable point is selected as the joint to extract human skeleton. The experimental results show that the robustness and accuracy are significantly improved, associated with a competitive operating efficiency by evaluating our approach with the proposed data set.

[1] K Kurita,et al. A New Motion Control Method for Bipedal Robot Based on Noncontact and Nonattached Human Motion Sensing Technique , 2011, IEEE Transactions on Industry Applications.

[2] Andrew Blake,et al. Efficient Human Pose Estimation from Single Depth Images , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3] W. Marsden. I and J , 2012 .

[4] Tao Chen,et al. Semantic segmentation of RGBD images based on deep depth regression , 2018, Pattern Recognit. Lett..

[5] Andrew W. Fitzgibbon,et al. Real-time human pose recognition in parts from single depth images , 2011, CVPR 2011.

[6] Sven J. Dickinson,et al. Recognize Human Activities from Partially Observed Videos , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[7] K. Kurita,et al. A new motion control method for bipedal robot based on non-contact and non-attached human motion sensing technique , 2009, 2009 International Conference on Electrical Machines and Systems.

[8] Christian Szegedy,et al. DeepPose: Human Pose Estimation via Deep Neural Networks , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[9] Mathieu Aubry,et al. Convolutional Neural Networks for joint object detection and pose estimation: A comparative study , 2014, ArXiv.

[10] Silvio Savarese,et al. Articulated part-based model for joint object detection and pose estimation , 2011, 2011 International Conference on Computer Vision.

[11] Timothy F. Cootes,et al. Fully Automatic Segmentation of the Proximal Femur Using Random Forest Regression Voting , 2013, IEEE Transactions on Medical Imaging.

[12] Wolfram Burgard,et al. Deep learning for human part discovery in images , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[13] Shu Liao,et al. Multi-Instance Deep Learning: Discover Discriminative Local Anatomies for Bodypart Recognition , 2016, IEEE Transactions on Medical Imaging.

[14] Mohan M. Trivedi,et al. Head Pose Estimation in Computer Vision: A Survey , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15] Hyun Myung,et al. Real-Time Human Pose Estimation and Gesture Recognition from Depth Images Using Superpixels and SVM Classifier , 2015, Sensors.

[16] Sankar K. Pal,et al. A review on image segmentation techniques , 1993, Pattern Recognit..

[17] Maode Ma,et al. Intelligent Image Recognition System for Marine Fouling Using Softmax Transfer Learning and Deep Convolutional Neural Networks , 2017, Complex..

[18] Yi Yang,et al. Articulated Human Detection with Flexible Mixtures of Parts , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19] Antonis A. Argyros,et al. Tracking the articulated motion of the human body with two RGBD cameras , 2014, Machine Vision and Applications.

[20] Cordelia Schmid,et al. Expanded Parts Model for Semantic Description of Humans in Still Images , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21] Je-Won Kang,et al. Combining random forest with multi-block local binary pattern feature selection for multiclass head pose estimation , 2017, PloS one.

[22] Zhaoyang Lu,et al. Transferable Feature Representation for Visible-to-Infrared Cross-Dataset Human Action Recognition , 2018, Complex..

[23] Jon Atli Benediktsson,et al. Hyperspectral Image Classification With Rotation Random Forest Via KPCA , 2017, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[24] James J. Little,et al. A Simple Yet Effective Baseline for 3d Human Pose Estimation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[25] Andrew Hynes,et al. Human Part Segmentation in Depth Images with Annotated Part Positions , 2018, Sensors.

[26] Claudia Lindner,et al. Robust and Accurate Shape Model Matching Using Random Forest Regression-Voting. , 2015, IEEE transactions on pattern analysis and machine intelligence.

[27] Andrea Sanna,et al. Semantics-Based Intelligent Human-Computer Interaction , 2016, IEEE Intelligent Systems.

[28] Yi Yang,et al. Depth-Based Hand Pose Estimation: Methods, Data, and Challenges , 2015, International Journal of Computer Vision.

[29] Sergio Escalera,et al. Graph cuts optimization for multi-limb human segmentation in depth maps , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[30] Gustavo Carneiro,et al. Formulating semantic image annotation as a supervised learning problem , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[31] Francesc Moreno-Noguer,et al. 3D Human Pose Estimation from a Single Image via Distance Matrix Regression , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[32] Ana-Maria Cretu,et al. Static and Dynamic Hand Gesture Recognition in Depth Data Using Dynamic Time Warping , 2016, IEEE Transactions on Instrumentation and Measurement.

[33] Xi Chen,et al. Precision Security: Integrating Video Surveillance with Surrounding Environment Changes , 2018, Complex..

[34] Leonidas J. Guibas,et al. Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[35] Gustavo Carneiro,et al. Supervised Learning of Semantic Classes for Image Annotation and Retrieval , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36] Marwan Torki,et al. Histogram of Oriented Displacements (HOD): Describing Trajectories of Human Joints for Action Recognition , 2013, IJCAI.

[37] Wei-Yun Yau,et al. Improving human body part detection using deep learning and motion consistency , 2016, 2016 14th International Conference on Control, Automation, Robotics and Vision (ICARCV).

[38] K. Nishi,et al. Generation of human depth images with body part labels for complex human pose recognition , 2017, Pattern Recognit..

[39] Christoph Rasche,et al. Rapid contour detection for image classification , 2017, IET Image Process..

[40] Bernt Schiele,et al. Articulated people detection and pose estimation: Reshaping the future , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[41] Zhenjun Tang,et al. Combining Generative/Discriminative Learning for Automatic Image Annotation and Retrieval , 2012 .

[42] Chi Xu,et al. Mouse Pose Estimation From Depth Images , 2015, ArXiv.

[43] Xian Sun,et al. Multi-view semi-supervised learning for image classification , 2016, Neurocomputing.

[44] Jakub Konecný,et al. One-shot-learning gesture recognition using HOG-HOF features , 2014, J. Mach. Learn. Res..

[45] Gérard G. Medioni,et al. Human pose estimation from a single view point, real-time range sensor , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[46] Prabhas Chongstitvatana,et al. Application of structured support vector machine backpropagation to a convolutional neural network for human pose estimation , 2017, Neural Networks.

[47] Qingquan Li,et al. Improving Land Use/Land Cover Classification by Integrating Pixel Unmixing and Decision Tree Methods , 2017, Remote. Sens..