PupilNet: Convolutional Neural Networks for Robust Pupil Detection

Real-time, accurate, and robust pupil detection is an essential prerequisite for pervasive video-based eye-tracking. However, automated pupil detection in real-world scenarios has proven to be an intricate challenge due to fast illumination changes, pupil occlusion, non centered and off-axis eye recording, and physiological eye characteristics. In this paper, we propose and evaluate a method based on a novel dual convolutional neural network pipeline. In its first stage the pipeline performs coarse pupil position identification using a convolutional neural network and subregions from a downscaled input image to decrease computational costs. Using subregions derived from a small window around the initial pupil position estimate, the second pipeline stage employs another convolutional neural network to refine this position, resulting in an increased pupil detection rate up to 25% in comparison with the best performing state-of-the-art algorithm. Annotated data sets can be made available upon request.

[1]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[2]  Dominik Schmidt,et al.  Eye Pull, Eye Push: Moving Objects between Large Screens and Personal Devices with Gaze and Touch , 2013, INTERACT.

[3]  Linden J. Ball,et al.  An Eye Movement Analysis of Web Page Usability , 2002 .

[4]  W. Rosenstiel,et al.  Driving with Binocular Visual Field Loss? A Study on a Supervised On-Road Parcours with Simultaneous Eye and Head Tracking , 2014, PloS one.

[5]  Pedro M. Domingos A few useful things to know about machine learning , 2012, Commun. ACM.

[6]  Manfred Tscheligi,et al.  Eye Tracking in the Car: Challenges in a Dual-Task Scenario on a Test Track , 2014, AutomotiveUI.

[7]  Zahra Hakimi,et al.  SET: a pupil detection method using sinusoidal approximation , 2015, Front. Neuroeng..

[8]  Dongheng Li,et al.  Starburst: A hybrid algorithm for video-based eye tracking combining feature-based and model-based approaches , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops.

[9]  Wolfgang Rosenstiel,et al.  ExCuSe: Robust Pupil Detection in Real-World Scenarios , 2015, CAIP.

[10]  Enkelejda Kasneci,et al.  Towards the automated recognition of assistance need for drivers with impaired visual field , 2013 .

[11]  Andreas Bulling,et al.  Pupil: an open source platform for pervasive eye tracking and mobile gaze-based interaction , 2014, UbiComp Adjunct.

[12]  W. Rosenstiel,et al.  Homonymous Visual Field Loss and Its Impact on Visual Exploration: A Supermarket Study. , 2014, Translational vision science & technology.

[13]  David Dussault,et al.  Noise performance comparison of ICCD with CCD and EMCCD cameras , 2004, SPIE Optics + Photonics.

[14]  Jürgen Schmidhuber,et al.  Multi-column deep neural networks for image classification , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Neil A. Dodgson,et al.  Robust real-time pupil tracking in highly off-axis images , 2012, ETRA.

[16]  Ingemar J. Cox,et al.  An Analysis of Camera Noise , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Lun Yu,et al.  A robust and accurate detection of pupil images , 2010, 2010 3rd International Conference on Biomedical Engineering and Informatics.

[18]  B. Tatler,et al.  The Moving Tablet Of The Eye , 2005 .

[19]  O.K. Tonguz,et al.  A High Speed Eye Tracking System with Robust Pupil Center Estimation Algorithm , 2007, 2007 29th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[20]  John Paulin Hansen,et al.  Evaluation of a low-cost open-source gaze tracker , 2010, ETRA.

[21]  Fengliang Xu,et al.  Real-time eye detection and tracking for driver observation under various light conditions , 2002, Intelligent Vehicle Symposium, 2002. IEEE.

[22]  Klaus-Robert Müller,et al.  Efficient BackProp , 2012, Neural Networks: Tricks of the Trade.

[23]  Susan K. Schnipke,et al.  Trials and tribulations of using an eye-tracking system , 2000, CHI Extended Abstracts.

[24]  Antonio García Dopico,et al.  A Precise Eye-Gaze Detection and Tracking System , 2003, WSCG.

[25]  Rasmus Berg Palm,et al.  Prediction as a candidate for learning deep hierarchical models of data , 2012 .

[26]  K. Jellinger The Moving Tablet of the Eye: The Origins of Modern Eye Movement Research , 2006 .

[27]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[28]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[29]  Y. Reibel,et al.  CCD or CMOS camera noise characterisation , 2003 .

[30]  Hilbert J. Kappen,et al.  On-line learning processes in artificial neural networks , 1993 .

[31]  M. Magnor,et al.  Real-Time Gaze Tracking with a Consumer-Grade Video Camera , 2010 .