Slice representation of range data for head pose estimation

Visual estimation of head pose is desirable for computer vision applications such as face recognition, human computer interaction, and affective computing. However, accurate estimation of head pose in uncontrolled environment is still a grand challenge. This paper proposes a novel feature representation model for accurate pose estimation. In this model, a range image is divided into a set of simple slices that contain abundant geometric cues can be used to accurately describe the poses of a subject. This model provides a general framework for designing new features for head pose estimation. According to this model, design of a new feature model for describing a slice, then a new set of features is generated by combining all slices for describing range images. Due to the huge number of slices that can be generated from single range image, even a simple description model of slice can achieve robust performance. With the guide of this model, two novel range image representation models, which are Local Slice Depth (LSD) and Local Slice Orientation (LSO), are designed. LSD can be used for coarse estimation of head poses, while LSO can achieve accurate results. Moreover, in order to evaluate the performance of proposed representation model, an automatic head pose estimation method is implemented using a Kinect sensor. Firstly both color and range images captured by a Kinect sensor are used to localize and segment the facial region from background. Secondly, two novel integral images, namely slice depth integral image and slice coordinates integral image, are proposed to achieve real-time feature extraction. Finally, random forests are used to learn a stable relationship between slice feature descriptors and head pose parameters. Experiments on both low-quality depth data set Biwi and high-quality depth data set ETH demonstrate state-of-the-art performance of our method. (C) 2014 Elsevier Inc. All rights reserved.

[1]  Tieniu Tan,et al.  Face pose estimation based on integral slice features of single depth images , 2011, The First Asian Conference on Pattern Recognition.

[2]  Luc Van Gool,et al.  Real time head pose estimation with random regression forests , 2011, CVPR 2011.

[3]  Andrew W. Fitzgibbon,et al.  Real-time human pose recognition in parts from single depth images , 2011, CVPR 2011.

[4]  Nicu Sebe,et al.  Combining Head Pose and Eye Location Information for Gaze Estimation , 2012, IEEE Transactions on Image Processing.

[5]  Luc Van Gool,et al.  Random Forests for Real Time 3D Face Analysis , 2012, International Journal of Computer Vision.

[6]  Mohan M. Trivedi,et al.  Head Pose Estimation and Augmented Reality Tracking: An Integrated System and Evaluation for Monitoring Driver Awareness , 2010, IEEE Transactions on Intelligent Transportation Systems.

[7]  Luc Van Gool,et al.  Fast 3D Scanning with Automatic Motion Compensation , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[9]  Tieniu Tan,et al.  Real-Time Head Pose Estimation Using Random Regression Forests , 2011, CCBR.

[10]  Horst Bischof,et al.  Supervised local subspace learning for continuous head pose estimation , 2011, CVPR 2011.

[11]  Luc Van Gool,et al.  Real-time face pose estimation from single range images , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Richard Bowden,et al.  Generalised Pose Estimation Using Depth , 2010, ECCV Workshops.

[13]  Chiraz BenAbdelkader Robust Head Pose Estimation Using Supervised Manifold Learning , 2010, ECCV.

[14]  Mohan M. Trivedi,et al.  Head Pose Estimation in Computer Vision: A Survey , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Chi Fang,et al.  Head Pose Estimation Based on Random Forests for Multiclass Classification , 2010, 2010 20th International Conference on Pattern Recognition.

[16]  Rama Chellappa,et al.  Automatic head pose estimation using randomly projected dense SIFT descriptors , 2012, 2012 19th IEEE International Conference on Image Processing.

[17]  Antonis A. Argyros,et al.  Head pose estimation on depth data based on Particle Swarm Optimization , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[18]  Alan F. Smeaton,et al.  Detector adaptation by maximising agreement between independent data sources , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Haibo Li,et al.  3D head pose estimation using the Kinect , 2011, 2011 International Conference on Wireless Communications and Signal Processing (WCSP).

[20]  Larry S. Davis,et al.  On partial least squares in head pose estimation: How to simultaneously deal with misalignment , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[21]  Daqiang Zhang,et al.  Head Pose Estimation Based on Manifold Embedding and Distance Metric Learning , 2009, ACCV.

[22]  Vijayan K. Asari,et al.  A Two-Layer Framework for Piecewise Linear Manifold-Based Head Pose Estimation , 2012, International Journal of Computer Vision.

[23]  Luc Van Gool,et al.  Real Time Head Pose Estimation from Consumer Depth Cameras , 2011, DAGM-Symposium.

[24]  Jiwen Lu,et al.  Ordinary Preserving Manifold Analysis for Human Age and Head Pose Estimation , 2013, IEEE Transactions on Human-Machine Systems.

[25]  Bingpeng Ma,et al.  A novel feature descriptor based on biologically inspired feature for head pose estimation , 2013, Neurocomputing.

[26]  Azriel Rosenfeld,et al.  Face recognition: A literature survey , 2003, CSUR.