Multi-Person Pose Estimation Using Group-Based Convolutional Neural Network Model

Human pose estimation has drawn extensive attention recently and there has been significant progress on it due to the rising popularity of convolutional neural networks (CNN). However, existing state-of-the-art approaches suffer from occlusion, complicated backgrounds, and substantial position fluctuations because of disregarding the human body form. Human parsing is a very pertinent activity that can provide crucial semantic data about bodily parts for position estimation. To overcome the aforesaid limitations, this paper introduces a human pose estimation method using a group-based convolutional neural network model. The proposed method adopts a bottom-up parsing strategy that yields features to extract skeletal key points in the human body. Moreover, it creates a grouping of anatomical key points for individuals by utilizing the non-parametric description for the key point association vector field. Experimental results indicate that the proposed method provides superior performance than the state-of-the-art algorithms in terms of accuracy. In addition, it optimizes its output and detects occluded as well as invisible key points by incorporating feature representation. The proposed method surpasses the recent methods, achieving 93% of the mean average accuracy.

[1]  G. Amer,et al.  Robust Speech Emotion Recognition Using CNN+LSTM Based on Stochastic Fractal Search Optimization Algorithm , 2022, IEEE Access.

[2]  Licheng Jiao,et al.  A Lightweight Top-Down Multi-Person Pose Estimation Method Based on Symmetric Transformation and Global Matching , 2022, IEEE Access.

[3]  J. Lerga,et al.  Detection of Non-Stationary GW Signals in High Noise From Cohen’s Class of Time–Frequency Representations Using Deep Learning , 2022, IEEE Access.

[4]  M. De Marsico,et al.  Inflated 3D ConvNet context analysis for violence detection , 2021, Machine Vision and Applications.

[5]  Sangyoun Lee,et al.  An Efficient Approach Using Knowledge Distillation Methods to Stabilize Performance in a Lightweight Top-Down Posture Estimation Network , 2021, Sensors.

[6]  Pooja Kherwa,et al.  Articulated Human Pose Estimation Using Greedy Approach , 2021, Artificial Intelligence.

[7]  A. Garbett,et al.  Towards Understanding People’s Experiences of AI Computer Vision Fitness Instructor Apps , 2021, Conference on Designing Interactive Systems.

[8]  Wataru Takano,et al.  Graph Stacked Hourglass Networks for 3D Human Pose Estimation , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Yaser Sheikh,et al.  OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Nojun Kwak,et al.  Exploring Rare Pose in Human Pose Estimation , 2020, IEEE Access.

[11]  Xiaoping Chen,et al.  Multi-Person Pose Estimation Under Complex Environment Based on Progressive Rotation Correction and Multi-Scale Feature Fusion , 2020, IEEE Access.

[12]  Jian Yang,et al.  Adversarial Learning of Structure-Aware Fully Convolutional Networks for Landmark Localization , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Ross B. Girshick,et al.  Mask R-CNN , 2017, 1703.06870.

[14]  Jinye Peng,et al.  Complex Human Pose Estimation via Keypoints Association Constraint Network , 2020, IEEE Access.

[15]  Chao-Kai Wen,et al.  Multi-Person Pose Estimation Using Thermal Images , 2020, IEEE Access.

[16]  Seong-Gyun Jeong,et al.  Anchor Loss: Modulating Loss Scale Based on Prediction Difficulty , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[17]  Xingyi Zhou,et al.  Objects as Points , 2019, ArXiv.

[18]  Dong Liu,et al.  Deep High-Resolution Representation Learning for Human Pose Estimation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Ying Wu,et al.  Deeply Learned Compositional Models for Human Pose Estimation , 2018, ECCV.

[20]  Shuicheng Yan,et al.  Human Pose Estimation with Parsing Induced Learner , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[21]  Fei Yang,et al.  Jointly Optimize Data Augmentation and Network Training: Adversarial Data Augmentation in Human Pose Estimation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[22]  Honggang Qi,et al.  Multi-Scale Structure-Aware Network for Human Pose Estimation , 2018, ECCV.

[23]  Gang Yu,et al.  Cascaded Pyramid Network for Multi-person Pose Estimation , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[24]  Zhi Zhang,et al.  Knowledge-Guided Deep Fractal Neural Networks for Human Pose Estimation , 2017, IEEE Transactions on Multimedia.

[25]  Xiaogang Wang,et al.  Learning Feature Pyramids for Human Pose Estimation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[26]  Xiu-Shen Wei,et al.  Adversarial PoseNet: A Structure-Aware Convolutional Network for Human Pose Estimation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[27]  Yaser Sheikh,et al.  OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Andrew Zisserman,et al.  Recurrent Human Pose Estimation , 2016, 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017).

[29]  Georgios Tzimiropoulos,et al.  Human Pose Estimation via Convolutional Part Heatmap Regression , 2016, ECCV.

[30]  Bernt Schiele,et al.  DeeperCut: A Deeper, Stronger, and Faster Multi-person Pose Estimation Model , 2016, ECCV.

[31]  Navdeep Jaitly,et al.  Chained Predictions Using Convolutional Neural Networks , 2016, ECCV.

[32]  Shimon Ullman,et al.  Human Pose Estimation Using Deep Consensus Voting , 2016, ECCV.

[33]  Varun Ramakrishna,et al.  Convolutional Pose Machines , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[34]  Peter V. Gehler,et al.  DeepCut: Joint Subset Partition and Labeling for Multi Person Pose Estimation , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[35]  Jitendra Malik,et al.  Human Pose Estimation with Iterative Error Feedback , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[36]  Peiyun Hu,et al.  Bottom-Up and Top-Down Reasoning with Hierarchical Rectified Gaussians , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[37]  Ilya Kostrikov,et al.  An Efficient Convolutional Network for Human Pose Estimation , 2016, BMVC.

[38]  Jonathan Tompson,et al.  Joint Training of a Convolutional Network and a Graphical Model for Human Pose Estimation , 2014, NIPS.

[39]  Bernt Schiele,et al.  2D Human Pose Estimation: New Benchmark and State of the Art Analysis , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[40]  Christian Szegedy,et al.  DeepPose: Human Pose Estimation via Deep Neural Networks , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.