Evaluation of head pose estimation methods for a non-cooperative biometric system

Automatic head pose estimation is playing more and more important role in current computer vision applications. Precise localization of face landmark points together with subsequent analysis allow to determine person's gaze direction or its facial expression. Such solutions are widely used in driver assistance systems, intelligent environments like smart rooms, human-computer interfaces or identification systems. Surprisingly, in the literature, the tasks of determining face yaw or pitch angles and localizing landmark points are approached as separate problems. In this work, authors combine these two issues and analyze them together. Initially, a deep investigation of other researchers work is presented. On this basis, authors examine selected methods in the context of their application in non-cooperative recognition system. Experiments are conducted on a new, dedicated for this purpose multi-pose face dataset. Obtained results yield mean error of face yaw angle calculation of 3°.

[1]  Josephine Sullivan,et al.  One millisecond face alignment with an ensemble of regression trees , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Mohan M. Trivedi,et al.  Head Pose Estimation in Computer Vision: A Survey , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Kamal Bijlani,et al.  Yaw Estimation Using Cylindrical and Ellipsoidal Face Models , 2014, IEEE Transactions on Intelligent Transportation Systems.

[4]  Stefanos Zafeiriou,et al.  300 Faces in-the-Wild Challenge: The First Facial Landmark Localization Challenge , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[5]  Kang Ryoung Park,et al.  Real-Time Gaze Estimator Based on Driver's Head Orientation for Forward Collision Warning System , 2011, IEEE Transactions on Intelligent Transportation Systems.

[6]  Simon Baker Real-time non-rigid driver head tracking for driver mental state estimation , 2004 .

[7]  Trevor Darrell,et al.  Recognizing gaze aversion gestures in embodied conversational discourse , 2006, ICMI '06.

[8]  David Beymer,et al.  Face recognition under varying pose , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Damian Kacperski,et al.  Pose-oriented face images acquisition platform , 2016, 2016 MIXDES - 23rd International Conference Mixed Design of Integrated Circuits and Systems.

[10]  Zhencheng Hu,et al.  Robust facial landmark localization using multi partial features , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[11]  Ling Chen,et al.  Large head movement tracking using sift-based registration , 2007, ACM Multimedia.

[12]  Yukinori Yamada,et al.  Development of a New Pre-crash Safety System , 2006 .

[13]  Mohan M. Trivedi,et al.  Video arrays for real-time tracking of person, head, and face in an intelligent room , 2003, Machine Vision and Applications.

[14]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[15]  Alex Pentland,et al.  Towards Measuring Human Interactions in Conversational Settings , 2001 .

[16]  Davis E. King,et al.  Dlib-ml: A Machine Learning Toolkit , 2009, J. Mach. Learn. Res..

[17]  Mohan M. Trivedi,et al.  Multi-spectral and multi-perspective video arrays for driver body tracking and activity analysis , 2007, Comput. Vis. Image Underst..

[18]  Yun Fu,et al.  hMouse: Head Tracking Driven Virtual Computer Mouse , 2007, 2007 IEEE Workshop on Applications of Computer Vision (WACV '07).

[19]  Deva Ramanan,et al.  Face detection, pose estimation, and landmark localization in the wild , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.