Human Body Orientation Estimation using Convolutional Neural Network

Personal robots are expected to interact with the user by recognizing the user's face. However, in most of the service robot applications, the user needs to move himself/herself to allow the robot to see him/her face to face. To overcome such limitations, a method for estimating human body orientation is required. Previous studies used various components such as feature extractors and classification models to classify the orientation which resulted in low performance. For a more robust and accurate approach, we propose the light weight convolutional neural networks, an end to end system, for estimating human body orientation. Our body orientation estimation model achieved 81.58% and 94% accuracy with the benchmark dataset and our own dataset respectively. The proposed method can be used in a wide range of service robot applications which depend on the ability to estimate human body orientation. To show its usefulness in service robot applications, we designed a simple robot application which allows the robot to move towards the user's frontal plane. With this, we demonstrated an improved face detection rate.

[1]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[2]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[3]  Christian Vollmer,et al.  Estimation of human upper body orientation for mobile robotics using an SVM decision tree on monocular images , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[4]  Ivan Lee,et al.  Mobile robotic active view planning for physiotherapy and physical exercise guidance , 2015, 2015 IEEE 7th International Conference on Cybernetics and Intelligent Systems (CIS) and IEEE Conference on Robotics, Automation and Mechatronics (RAM).

[5]  Angelo M. Sabatini,et al.  Estimating Three-Dimensional Orientation of Human Body Parts by Inertial/Magnetic Sensing , 2011, Sensors.

[6]  Dima Damen,et al.  Recognizing linked events: Searching the space of feasible explanations , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Alexandre Heili,et al.  A joint estimation of head and body orientation cues in surveillance video , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[8]  Bruno Steux,et al.  tinySLAM: A SLAM algorithm in less than 200 lines C-language program , 2010, 2010 11th International Conference on Control Automation Robotics & Vision.

[9]  In-So Kweon,et al.  Real-Time Head Orientation from a Monocular Camera Using Deep Neural Network , 2014, ACCV.

[10]  Songhwai Oh,et al.  Chance-constrained target tracking for mobile robots , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[11]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Sheng Tang,et al.  Accurate Estimation of Human Body Orientation From RGB-D Sensors , 2013, IEEE Transactions on Cybernetics.

[13]  Cristian Sminchisescu,et al.  Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Gang Qian,et al.  Binocular dance pose recognition and body orientation estimation via multilinear analysis , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[15]  Cheng-Kok Koh,et al.  A 3D-point-cloud feature for human-pose estimation , 2013, 2013 IEEE International Conference on Robotics and Automation.