An Ensemble Model For Human Posture Recognition

Human Body Pose Estimation (HBPE) and HumanBody Posture Recognition(HBPR) have improved significantly in the past decade. Gaining access to huge amounts of data, Kinect camera, neural networks and specifically deep convolutional neural networks (deep convnets) have led to fascinating success in these fields. In this paper we propose an ensemble model for human body posture recognition. Deep convnets are the main building block and fundamental aspect of our proposed model. We leverage deep convnets in two variations to classify postures. First, we use them for an end-to-end training scenario. We perform transfer learning with Imagenet weights on deep convnets with our gathered dataset of RGB images to classify five different postures. Second, we use a pre-trained deep convnet[1] (pose estimator) for estimating human body joints in RGB images. The pre-trained pose estimator has been trained to calculate a total of 17 2D joints coordinates and we utilize these coordinates to train a decision tree-based classifier for classification among five classes. Both variations are examined with different settings. The best settings for both variations are combined together to create our proposed model. More specifically, the classification layers of both variations are stacked together and fed to a logistic regression unit for a better classification result. Transfer learning, training and experiments in this paper are based on only RGB images from our gathered dataset and human body joints coordinates extracted from these images, which conveys that our proposed model does not require depth images or any sensor. Eventually, experimental results on the images show that the proposed model has higher performance than fundamental variations. Specifically, our model is able to correctly recognize the human posture in the majority of the images that one of the two fundamental variations fails to classify. The code for the proposed model and our gathered dataset are available on github1.

[1]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[2]  Stefan Wermter,et al.  A Multichannel Convolutional Neural Network for Hand Posture Recognition , 2014, ICANN.

[3]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[4]  Wolfram Burgard,et al.  3D Human Pose Estimation in RGBD Images for Robotic Task Learning , 2018, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[5]  Rafael Cabeza,et al.  Robust and accurate 2D-tracking-based 3D positioning method: Application to head pose estimation , 2019, Comput. Vis. Image Underst..

[6]  Cheng Han,et al.  Hybrid approach for human posture recognition using anthropometry and BP neural network based on Kinect V2 , 2019, EURASIP J. Image Video Process..

[7]  W. Marsden I and J , 2012 .

[8]  Wen-June Wang,et al.  Human Posture Recognition Based on Images Captured by the Kinect Sensor , 2016 .

[9]  Mohamed El Amine Elforaici,et al.  Posture Recognition Using an RGB-D Camera: Exploring 3D Body Modeling and Deep Learning Approaches , 2018, 2018 IEEE Life Sciences Conference (LSC).

[10]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[11]  Shusheng Bi,et al.  Human Posture Recognition Using Skeleton and Depth Information , 2018, 2018 WRC Symposium on Advanced Robotics and Automation (WRC SARA).

[12]  Yaser Sheikh,et al.  OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Zhifeng Liu,et al.  Human standing posture recognition based on CNN and pressure floor , 2020, J. Comput. Methods Sci. Eng..

[14]  Thi-Lan Le,et al.  Human posture recognition using human skeleton provided by Kinect , 2013, 2013 International Conference on Computing, Management and Telecommunications (ComManTel).

[15]  Ao Tang,et al.  A Real-Time Hand Posture Recognition System Using Deep Neural Networks , 2015, ACM Trans. Intell. Syst. Technol..

[16]  Hyun Myung,et al.  Real-Time Human Pose Estimation and Gesture Recognition from Depth Images Using Superpixels and SVM Classifier , 2015, Sensors.

[17]  Sahil Shah,et al.  Gesture Recognition Technology: A Review , 2012 .