DeepVOG: Open-source pupil segmentation and gaze estimation in neuroscience using deep learning

BACKGROUND A prerequisite for many eye tracking and video-oculography (VOG) methods is an accurate localization of the pupil. Several existing techniques face challenges in images with artifacts and under naturalistic low-light conditions, e.g. with highly dilated pupils. NEW METHOD For the first time, we propose to use a fully convolutional neural network (FCNN) for segmentation of the whole pupil area, trained on 3946 VOG images hand-annotated at our institute. We integrate the FCNN into DeepVOG, along with an established method for gaze estimation from elliptical pupil contours, which we improve upon by considering our FCNN's segmentation confidence measure. RESULTS The FCNN output simultaneously enables us to perform pupil center localization, elliptical contour estimation and blink detection, all with a single network and with an assigned confidence value, at framerates above 130 Hz on commercial workstations with GPU acceleration. Pupil centre coordinates can be estimated with a median accuracy of around 1.0 pixel, and gaze estimation is accurate to within 0.5 degrees. The FCNN is able to robustly segment the pupil in a wide array of datasets that were not used for training. COMPARISON WITH EXISTING METHODS We validate our method against gold standard eye images that were artificially rendered, as well as hand-annotated VOG data from a gold-standard clinical system (EyeSeeCam) at our institute. CONCLUSIONS Our proposed FCNN-based pupil segmentation framework is accurate, robust and generalizes well to new VOG datasets. We provide our code and pre-trained FCNN model open-source and for free under www.github.com/pydsgz/DeepVOG.

[1]  Zheru Chi,et al.  Video-based biometric identification using eye tracking technique , 2012, 2012 IEEE International Conference on Signal Processing, Communication and Computing (ICSPCC 2012).

[2]  Thiago Santini,et al.  EyeRecToo: Open-source Software for Real-time Pervasive Head-mounted Eye Tracking , 2017, VISIGRAPP.

[3]  Seunghoon Hong,et al.  Learning Deconvolution Network for Semantic Segmentation , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[4]  Aaron B. Hoffman,et al.  Thirty-something categorization results explained: selective attention, eyetracking, and models of category learning. , 2005, Journal of experimental psychology. Learning, memory, and cognition.

[5]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[6]  Pierre Soille,et al.  Morphological Image Analysis: Principles and Applications , 2003 .

[7]  Seyed-Ahmad Ahmadi,et al.  V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation , 2016, 2016 Fourth International Conference on 3D Vision (3DV).

[8]  Susan K. Schnipke,et al.  Trials and tribulations of using an eye-tracking system , 2000, CHI Extended Abstracts.

[9]  Stefan Kohlbecher,et al.  A novel approach to video-based pupil tracking , 2009, 2009 IEEE International Conference on Systems, Man and Cybernetics.

[10]  Takeshi Saitoh,et al.  CNN-Based Pupil Center Detection for Wearable Gaze Estimation System , 2017, Appl. Comput. Intell. Soft Comput..

[11]  Igi Ardiyanto,et al.  Robust pupil tracking algorithm based on ellipse fitting , 2016, 2016 International Symposium on Electronics and Smart Devices (ISESD).

[12]  R. Nagel,et al.  Search Dynamics in Consumer Choice under Time Pressure: An Eye-Tracking Study , 2011 .

[13]  Terry K Koo,et al.  A Guideline of Selecting and Reporting Intraclass Correlation Coefficients for Reliability Research. , 2016, Journal Chiropractic Medicine.

[14]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[15]  Gerald L. Lohse,et al.  Consumer Eye Movement Patterns on Yellow Pages Advertising , 1997 .

[16]  Amine Ben Slama,et al.  A new preprocessing parameter estimation based on geodesic active contour model for automatic vestibular neuritis diagnosis , 2017, Artif. Intell. Medicine.

[17]  Jafar Saniie,et al.  Eye tracking using monocular camera for gaze estimation applications , 2016, 2016 IEEE International Conference on Electro Information Technology (EIT).

[18]  Wojciech Matusik,et al.  Eye Tracking for Everyone , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Wolfgang Rosenstiel,et al.  ExCuSe: Robust Pupil Detection in Real-World Scenarios , 2015, CAIP.

[20]  Thiago Santini,et al.  ElSe: ellipse selection for robust pupil detection in real-world environments , 2015, ETRA.

[21]  Qiang Ji,et al.  3D gaze estimation with a single camera without IR illumination , 2008, 2008 19th International Conference on Pattern Recognition.

[22]  Dongheng Li,et al.  Starburst: A hybrid algorithm for video-based eye tracking combining feature-based and model-based approaches , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops.

[23]  Neil Dodgson,et al.  A fully-automatic , temporal approach to single camera , glint-free 3 D eye model fitting , 2013 .

[24]  London,et al.  Light, Colour and Vision , 1969 .

[25]  Thomas Brox,et al.  Striving for Simplicity: The All Convolutional Net , 2014, ICLR.

[26]  Neil A. Dodgson,et al.  Robust real-time pupil tracking in highly off-axis images , 2012, ETRA.

[27]  Beno Benhabib,et al.  Three-dimensional location estimation of circular features for machine vision , 1992, IEEE Trans. Robotics Autom..

[28]  Hirotake Yamazoe,et al.  Remote gaze estimation with a single camera based on facial-feature tracking without special calibration actions , 2008, ETRA.

[29]  M. Bradley,et al.  The pupil as a measure of emotional arousal and autonomic activation. , 2008, Psychophysiology.

[30]  Yusuke Sugano,et al.  Labelled pupils in the wild: a dataset for studying pupil detection in unconstrained environments , 2015, ETRA.

[31]  Gjergji Kasneci,et al.  PupilNet: Convolutional Neural Networks for Robust Pupil Detection , 2016, ArXiv.

[32]  Moshe Eizenman,et al.  General theory of remote gaze estimation using the pupil center and corneal reflections , 2006, IEEE Transactions on Biomedical Engineering.

[33]  Kang Ryoung Park,et al.  Deep Learning-Based Gaze Detection System for Automobile Drivers Using a NIR Camera Sensor , 2018, Sensors.

[34]  Iain Matthews,et al.  - 100-特集 Passive Driver Gaze Tracking with Active Appearance , 2004 .

[35]  Andreas Bulling,et al.  A novel approach to single camera, glint-free 3D eye model fitting including corneal refraction , 2018, ETRA.

[36]  D Kömpf,et al.  Deficits of smooth pursuit initiation in patients with degenerative cerebellar lesions. , 1999, Brain : a journal of neurology.

[37]  Ajay Kumar,et al.  Comparison and combination of iris matchers for reliable personal authentication , 2010, Pattern Recognit..

[38]  Wen-Bing Horng,et al.  Driver fatigue detection based on eye tracking and dynamk, template matching , 2004, IEEE International Conference on Networking, Sensing and Control, 2004.

[39]  Tomi Kinnunen,et al.  Eye-Movements as a Biometric , 2005, SCIA.

[40]  Bram van Ginneken,et al.  A survey on deep learning in medical image analysis , 2017, Medical Image Anal..

[41]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[42]  Neil A. Dodgson,et al.  Rendering synthetic ground truth images for eye tracker evaluation , 2014, ETRA.