GEDDnet: A Network for Gaze Estimation with Dilation and Decomposition

Appearance-based gaze estimation from RGB images provides relatively unconstrained gaze tracking from commonly available hardware. The accuracy of subject-independent models is limited partly by small intra-subject and large inter-subject variations in appearance, and partly by a latent subject-dependent bias. To improve estimation accuracy, we propose to use dilated-convolutions in a deep convolutional neural network to capture subtle changes in the eye images, and a novel gaze decomposition method that decomposes the gaze angle into the sum of a subject-independent gaze estimate from the image and a subject-dependent bias. To further reduce estimation error, we propose a calibration method that estimates the bias from a few images taken as the subject gazes at only a few or even just a single gaze target. This significantly redues calibration time and complexity. Experiments on four datasets, including a new dataset we collected containing large variations in head pose and face location, indicate that even without calibration the estimator already outperforms state-of-the-art methods by more than 6.3%. The proposed calibration method is robust to the location of calibration target and reduces estimation error significantly (up to 35.6%), achieving state-of-the-art performance with much less calibration data than required by previously proposed methods.

[1]  Gang Liu,et al.  Deep Multitask Gaze Estimation with a Constrained Landmark-Gaze Model , 2018, ECCV Workshops.

[2]  Yusuke Sugano,et al.  Revisiting data normalization for appearance-based gaze estimation , 2018, ETRA.

[3]  Joohwan Kim,et al.  Towards foveated rendering for gaze-tracked virtual reality , 2016, ACM Trans. Graph..

[4]  David A. Atchison,et al.  Optics of the Human Eye , 2023 .

[5]  Gang Liu,et al.  A Differential Approach for Gaze Estimation with Calibration , 2018, BMVC.

[6]  James M. Rehg,et al.  Connecting Gaze, Scene, and Attention: Generalized Attention Estimation via Joint Modeling of Gaze and Scene Saliency , 2018, ECCV.

[7]  Otmar Hilliges,et al.  Deep Pictorial Gaze Estimation , 2018, ECCV.

[8]  Gang Liu,et al.  A Differential Approach for Gaze Estimation , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Bertram E. Shi,et al.  Task-embedded online eye-tracker calibration for improving robustness to head motion , 2019, ETRA.

[10]  Erik Lindén,et al.  Learning to Personalize in Appearance-Based Gaze Tracking , 2018, 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW).

[11]  Wangjiang Zhu,et al.  Monocular Free-Head 3D Gaze Tracking with Deep Learning and Geometry Constraints , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[12]  Shane L. Rogers,et al.  Using dual eye tracking to uncover personal gaze patterns during social interaction , 2018, Scientific Reports.

[13]  Jean-Marc Odobez,et al.  EYEDIAP: a database for the development and evaluation of gaze estimation algorithms from RGB and RGB-D cameras , 2014, ETRA.

[14]  Jan Kautz,et al.  Light-Weight Head Pose Invariant Gaze Tracking , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[15]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[16]  Bertram E. Shi,et al.  Probabilistic adjustment of dwell time for eye typing , 2017, 2017 10th International Conference on Human System Interactions (HSI).

[17]  Yoichi Sato,et al.  Learning-by-Synthesis for Appearance-Based 3D Gaze Estimation , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[18]  Sergey Ioffe,et al.  Batch Renormalization: Towards Reducing Minibatch Dependence in Batch-Normalized Models , 2017, NIPS.

[19]  Timo Schneider,et al.  Manifold Alignment for Person Independent Appearance-Based Gaze Estimation , 2014, 2014 22nd International Conference on Pattern Recognition.

[20]  Bertram E. Shi,et al.  Appearance-Based Gaze Estimation Using Dilated-Convolutions , 2018, ACCV.

[21]  Jan Cech,et al.  Real-Time Eye Blink Detection using Facial Landmarks , 2016 .

[22]  Sergio Escalera,et al.  Recurrent CNN for 3D Gaze Estimation using Appearance and Shape Cues , 2018, BMVC.

[23]  Ernesto Brau,et al.  Multiple-Gaze Geometry: Inferring Novel 3D Locations from Gazes Observed in Monocular Video , 2018, ECCV.

[24]  Hyunwoo Kim,et al.  Mixed Effects Neural Networks (MeNets) With Applications to Gaze Estimation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Mario Fritz,et al.  Appearance-based gaze estimation in the wild , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  Yusuke Sugano,et al.  Evaluation of Appearance-Based Methods and Implications for Gaze-Based Applications , 2019, CHI.

[27]  Kai Kunze,et al.  Anyorbit: orbital navigation in virtual environments with eye-tracking , 2018, ETRA.

[28]  Bertram E. Shi,et al.  Offset Calibration for Appearance-Based Gaze Estimation via Gaze Decomposition , 2019, 2020 IEEE Winter Conference on Applications of Computer Vision (WACV).

[29]  Frans W. Cornelissen,et al.  Towards using the spatio-temporal properties of eye movements to classify visual field defects , 2018, ETRA.

[30]  Steven K. Feiner,et al.  Gaze locking: passive eye contact detection for human-object interaction , 2013, UIST.

[31]  Yiannis Demiris,et al.  RT-GENE: Real-Time Eye Gaze Estimation in Natural Environments , 2018, ECCV.

[32]  Vladlen Koltun,et al.  Multi-Scale Context Aggregation by Dilated Convolutions , 2015, ICLR.

[33]  Subramanian Ramanathan,et al.  Eye Contact Detection via Deep Neural Networks , 2017, HCI.

[34]  Shenghua Gao,et al.  Multiview Multitask Gaze Estimation With Deep Convolutional Neural Networks , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[35]  Davis E. King,et al.  Dlib-ml: A Machine Learning Toolkit , 2009, J. Mach. Learn. Res..

[36]  Zhe He,et al.  Photo-Realistic Monocular Gaze Redirection Using Generative Adversarial Networks , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[37]  Louis-Philippe Morency,et al.  OpenFace 2.0: Facial Behavior Analysis Toolkit , 2018, 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018).

[38]  Jan Kautz,et al.  Few-Shot Adaptive Gaze Estimation , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[39]  George Papandreou,et al.  Rethinking Atrous Convolution for Semantic Image Segmentation , 2017, ArXiv.

[40]  Feng Lu,et al.  Appearance-Based Gaze Estimation via Evaluation-Guided Asymmetric Regression , 2018, ECCV.

[41]  Peter Robinson,et al.  Learning an appearance-based gaze estimator from one million synthesised images , 2016, ETRA.

[42]  Moshe Eizenman,et al.  General theory of remote gaze estimation using the pupil center and corneal reflections , 2006, IEEE Transactions on Biomedical Engineering.

[43]  Tomas Pfister,et al.  Learning from Simulated and Unsupervised Images through Adversarial Training , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[44]  Gang Liu,et al.  Improving Few-Shot User-Specific Gaze Adaptation via Gaze Redirection Synthesis , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[45]  Wojciech Matusik,et al.  Eye Tracking for Everyone , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[46]  Mario Fritz,et al.  MPIIGaze: Real-World Dataset and Deep Appearance-Based Gaze Estimation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[47]  Chandan Kumar,et al.  GazeTheWeb: A Gaze-Controlled Web Browser , 2017, W4A.

[48]  Mario Fritz,et al.  It’s Written All Over Your Face: Full-Face Appearance-Based Gaze Estimation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[49]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[50]  Wolfgang Rosenstiel,et al.  CBF: circular binary features for robust and real-time pupil center detection , 2018, ETRA.

[51]  Shaojie Shen,et al.  SLAM-based localization of 3D gaze using a mobile eye tracker , 2018, ETRA.

[52]  Qiang Ji,et al.  A Hierarchical Generative Model for Eye Image Synthesis and Eye Gaze Estimation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[53]  Bilge Mutlu,et al.  Anticipatory robot control for efficient human-robot collaboration , 2016, 2016 11th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[54]  T. Loetscher,et al.  Eye Movements During Everyday Behavior Predict Personality Traits , 2018, Front. Hum. Neurosci..