Deep Fusion for 3D Gaze Estimation From Natural Face Images Using Multi-Stream CNNs

Over the last few decades, eye gaze estimation techniques have been thoroughly investigated by many researchers. However, predicting a 3D gaze from a 2D natural image remains challenging because it has to deal with several issues such as diverse head positions, face shape transformation, illumination variations, and subject individuality. Many previous studies employ convolutional neural networks (CNNs) for this task, and yet the accuracy needs improvement for its practical use. In this paper, we propose a 3D gaze estimation framework based on the data science perspective: First, a novel neural network architecture is designed to exploit every possible visual attribute such as the states of both eyes and the head position, including several augmentations; secondly, the data fusion method is utilized by incorporating multiple gaze datasets. Extensive experiments were carried out using two standard eye gaze datasets, including comparative analysis. The experimental results suggest that our method outperforms state-of-the-art with 2.8 degrees for MPIIGaze and 3.05 degrees for EYEDIAP dataset, respectively, indicating that it has a potential for real applications.

[1]  Yanxia Zhang,et al.  SideWays: a gaze interface for spontaneous interaction with situated displays , 2013, CHI.

[2]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[3]  R. Pieters,et al.  Visual Attention to Repeated Print Advertising: A Test of Scanpath Theory , 1999 .

[4]  Yoichi Sato,et al.  Appearance-Based Gaze Estimation Using Visual Saliency , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  S. Liversedge,et al.  Saccadic eye movements and cognition , 2000, Trends in Cognitive Sciences.

[6]  Qiang Ji,et al.  3D gaze estimation with a single camera without IR illumination , 2008, 2008 19th International Conference on Pattern Recognition.

[7]  Yanxia Zhang,et al.  Pupil-canthi-ratio: a calibration-free method for tracking horizontal gaze direction , 2014, AVI.

[8]  Arnaldo J. Abrantes,et al.  Eye Gaze as a Human-computer Interface , 2014 .

[9]  Juan J. Cerrolaza,et al.  Evaluation of accurate eye corner detection methods for gaze estimation , 2014 .

[10]  Wojciech Matusik,et al.  Eye Tracking for Everyone , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Peter Corcoran,et al.  Convolutional Neural Network Implementation for Eye-Gaze Estimation on Low-Quality Consumer Imaging Systems , 2019, IEEE Transactions on Consumer Electronics.

[12]  Sergio Escalera,et al.  Recurrent CNN for 3D Gaze Estimation using Appearance and Shape Cues , 2018, BMVC.

[13]  Mario Fritz,et al.  It’s Written All Over Your Face: Full-Face Appearance-Based Gaze Estimation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[14]  Carlos Hitoshi Morimoto,et al.  Detecting eye position and gaze from a single camera and 2 light sources , 2002, Object recognition supported by user interaction for service robots.

[15]  Yiannis Demiris,et al.  RT-GENE: Real-Time Eye Gaze Estimation in Natural Environments , 2018, ECCV.

[16]  Mario Fritz,et al.  MPIIGaze: Real-World Dataset and Deep Appearance-Based Gaze Estimation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Jean-Marc Odobez,et al.  EYEDIAP: a database for the development and evaluation of gaze estimation algorithms from RGB and RGB-D cameras , 2014, ETRA.

[18]  Feng Duan,et al.  Training a Camera to Perform Long-Distance Eye Tracking by Another Eye-Tracker , 2019, IEEE Access.

[19]  Yunhui Liu,et al.  Appearance-Based Gaze Estimator for Natural Interaction Control of Surgical Robots , 2019, IEEE Access.

[20]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[21]  Feng Lu,et al.  Appearance-Based Gaze Estimation via Evaluation-Guided Asymmetric Regression , 2018, ECCV.

[22]  Rik Pieters,et al.  Attention Capture and Transfer in Advertising: Brand, Pictorial, and Text-Size Effects , 2004 .

[23]  R. Pieters,et al.  A Review of Eye-Tracking Research in Marketing , 2008 .

[24]  Shenghua Gao,et al.  Multiview Multitask Gaze Estimation With Deep Convolutional Neural Networks , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[25]  R. Pieters,et al.  Visual attention during brand choice : The impact of time pressure and task motivation , 1999 .

[26]  Yusuke Sugano,et al.  Revisiting data normalization for appearance-based gaze estimation , 2018, ETRA.

[27]  Otmar Hilliges,et al.  Deep Pictorial Gaze Estimation , 2018, ECCV.

[28]  Shenghua Gao,et al.  RGBD Based Gaze Estimation via Multi-Task CNN , 2019, AAAI.

[29]  Otmar Hilliges,et al.  Learning to find eye region landmarks for remote gaze estimation in unconstrained settings , 2018, ETRA.

[30]  Michel Wedel,et al.  Eye tracking for visual marketing , 2008 .

[31]  Dan Witzner Hansen,et al.  Eye tracking in the wild , 2005, Comput. Vis. Image Underst..

[32]  Narendra Ahuja,et al.  Appearance-based eye gaze estimation , 2002, Sixth IEEE Workshop on Applications of Computer Vision, 2002. (WACV 2002). Proceedings..

[33]  Jiaqi Jiang,et al.  Learning A 3D Gaze Estimator with Improved Itracker Combined with Bidirectional LSTM , 2019, 2019 IEEE International Conference on Multimedia and Expo (ICME).

[34]  Abdulmotaleb El-Saddik,et al.  A Novel Eye-Gaze-Controlled Wheelchair System for Navigating Unknown Environments: Case Study With a Person With ALS , 2016, IEEE Access.

[35]  Mario Fritz,et al.  Appearance-based gaze estimation in the wild , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[36]  Alan Kennedy,et al.  Book Review: Eye Tracking: A Comprehensive Guide to Methods and Measures , 2016, Quarterly journal of experimental psychology.

[37]  Qiong Huang,et al.  TabletGaze: Unconstrained Appearance-based Gaze Estimation in Mobile Tablets , 2015 .

[38]  Stephen Chi-fai Chan,et al.  Building a Self-Learning Eye Gaze Model from User Interaction Data , 2014, ACM Multimedia.

[39]  Zhiwei Zhu,et al.  Nonlinear Eye Gaze Mapping Function Estimation via Support Vector Regression , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[40]  J Merchant,et al.  Remote measurement of eye direction allowing subject motion over one cubic foot of space. , 1974, IEEE transactions on bio-medical engineering.

[41]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[42]  Andrew Blake,et al.  Sparse and Semi-supervised Visual Mapping with the S^3GP , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[43]  Radu Horaud,et al.  A Comprehensive Analysis of Deep Regression , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[44]  Iain Matthews,et al.  Passive Driver Gaze Tracking with Active Appearance Models (特集 センシング技術) , 2004 .

[45]  B. Rogé,et al.  Visual social attention in autism spectrum disorder: Insights from eye tracking studies , 2014, Neuroscience & Biobehavioral Reviews.

[46]  Shumeet Baluja,et al.  Non-Intrusive Gaze Tracking Using Artificial Neural Networks , 1993, NIPS.

[47]  Zhiwei Zhu,et al.  Eye gaze tracking under natural head movements , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[48]  Mohan S. Kankanhalli,et al.  Emotional Attention: A Study of Image Sentiment and Visual Attention , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[49]  Gang Liu,et al.  A Differential Approach for Gaze Estimation , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[50]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[51]  Walter Huber,et al.  Abnormal eye movement behaviour during text reading in neglect syndrome: A case study , 1992, Neuropsychologia.

[52]  Takahiro Okabe,et al.  Adaptive Linear Regression for Appearance-Based Gaze Estimation , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[53]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.