A novel 2D/3D database with automatic face annotation for head tracking and pose estimation

We present a new public database of videos for head tracking and pose estimation.We provide ground-truth data with lower noise than other similar frameworks.We have developed an automatic face annotation procedure with negligible error.We show the utility of the database for training and evaluation of algorithms.We carry out a thorough comparison between state-of-the-art head tracking methods. A new public database of videos for head tracking and pose estimation is presented in this paper with the goal of establishing a new framework for algorithm validation, replacing out of date frameworks. Position data has been recorded with a magnetic sensor-transmitter that has previously been aligned and synchronized with a commercial webcam, and we provide reliable ground-truth for 3D rotation and translation of the head with respect to the camera. In addition to this, an automatic face annotation procedure has been developed, which provides the image position of 54 facial landmarks, with negligible error, in every video frame in the database. This image ground-truth can be used for algorithm training or head tracking evaluation, among others. In order to show the usability of the database, we evaluate three head tracking approaches and three head models, and combine them to provide nine different head pose estimation sets of results. We show the validity of the presented database both for training and evaluation of head tracking and pose estimation methods, and provide an interesting comparison in performance of state-of-the-art algorithms. These results may also serve as reference to encourage other researchers to train and test their algorithms with this database, and compare their results with the ones presented in this paper.

[1]  Subramanian Ramanathan,et al.  Evaluating Multi-task Learning for Multi-view Head-Pose Classification in Interactive Environments , 2014, 2014 22nd International Conference on Pattern Recognition.

[2]  B. H. Pawan Prasad,et al.  A robust head pose estimation system for uncalibrated monocular videos , 2010, ICVGIP '10.

[3]  Daijin Kim,et al.  Robust head tracking using 3D ellipsoidal head model in particle filter , 2008, Pattern Recognit..

[4]  Timothy F. Cootes,et al.  Active Appearance Models , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Myung Jin Chung,et al.  A novel non-intrusive eye gaze estimation using cross-ratio under large head motion , 2005, Comput. Vis. Image Underst..

[6]  Mohan M. Trivedi,et al.  Head Pose Estimation in Computer Vision: A Survey , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Subramanian Ramanathan,et al.  No Matter Where You Are: Flexible Graph-Guided Multi-task Learning for Multi-view Head Pose Classification under Target Motion , 2013, 2013 IEEE International Conference on Computer Vision.

[8]  Juan J. Cerrolaza,et al.  Hierarchical Statistical Shape Models of Multiobject Anatomical Structures: Application to Brain MRI , 2012, IEEE Transactions on Medical Imaging.

[9]  Jing Xiao,et al.  Robust full-motion recovery of head by dynamic templates and re-registration techniques , 2002, Proceedings of Fifth IEEE International Conference on Automatic Face Gesture Recognition.

[10]  Qiang Ji,et al.  In the Eye of the Beholder: A Survey of Models for Eyes and Gaze , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Timothy F. Cootes,et al.  Active Shape Models-Their Training and Application , 1995, Comput. Vis. Image Underst..

[12]  Rama Chellappa,et al.  Growing Regression Forests by Classification: Applications to Object Pose Estimation , 2013, ECCV.

[13]  Marco La Cascia,et al.  Fast, Reliable Head Tracking under Varying Illumination: An Approach Based on Registration of Texture-Mapped 3D Models , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Zhiwei Zhu,et al.  Novel Eye Gaze Tracking Techniques Under Natural Head Movement , 2007, IEEE Transactions on Biomedical Engineering.

[15]  Sami Romdhani,et al.  A 3D Face Model for Pose and Illumination Invariant Face Recognition , 2009, 2009 Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance.

[16]  Jian-Gang Wang,et al.  EM enhancement of 3D head pose estimated by point at infinity , 2007, Image Vis. Comput..

[17]  Nicu Sebe,et al.  Combining Head Pose and Eye Location Information for Gaze Estimation , 2012, IEEE Transactions on Image Processing.

[18]  Ling Chen,et al.  Large head movement tracking using sift-based registration , 2007, ACM Multimedia.

[19]  Whoi-Yul Kim,et al.  Long-Range Gaze Tracking System for Large Movements , 2013, IEEE Transactions on Biomedical Engineering.

[20]  Murphy-ChutorianErik,et al.  Head Pose Estimation in Computer Vision , 2009 .

[21]  Thomas Vetter,et al.  On compositional Image Alignment, with an application to Active Appearance Models , 2009, CVPR.

[22]  Dario Cazzato,et al.  An Investigation on the Feasibility of Uncalibrated and Unconstrained Gaze Tracking for Human Assistive Applications by Using Head Pose Estimation , 2014, Sensors.

[23]  Shaogang Gong,et al.  Face distributions in similarity space under varying head pose , 2001, Image Vis. Comput..

[24]  Larry S. Davis,et al.  Model-based object pose in 25 lines of code , 1992, International Journal of Computer Vision.

[25]  Takeo Kanade,et al.  Robust 3D Head Tracking by Online Feature Registration , 2008 .

[26]  Frans C. T. van der Helm,et al.  Accurate Gaze Direction Measurements With Free Head Movement for Strabismus Angle Estimation , 2013, IEEE Transactions on Biomedical Engineering.

[27]  Thomas Vetter,et al.  Face Recognition Based on Fitting a 3D Morphable Model , 2003, IEEE Trans. Pattern Anal. Mach. Intell..