Improving real-time CNN-based pupil detection through domain-specific data augmentation

Deep learning is a promising technique for real-world pupil detection. However, the small amount of available accurately-annotated data poses a challenge when training such networks. Here, we utilize non-challenging eye videos where algorithmic approaches perform virtually without errors to automatically generate a foundational data set containing subpixel pupil annotations. Then, we propose multiple domain-specific data augmentation methods to create unique training sets containing controlled distributions of pupil-detection challenges. The feasibility, convenience, and advantage of this approach is demonstrated by training a CNN with these datasets. The resulting network outperformed current methods in multiple publicly-available, realistic, and challenging datasets, despite being trained solely with the augmented eye images. This network also exhibited better generalization w.r.t. the latest state-of-the-art CNN: Whereas on datasets similar to training data, the nets displayed similar performance, on datasets unseen to both networks, ours outperformed the state-of-the-art by ≈27% in terms of detection rate.

[1]  Wolfgang Rosenstiel,et al.  The art of pervasive eye tracking: unconstrained eye tracking in the Austrian Gallery Belvedere , 2018, PETMEI@ETRA.

[2]  Thiago Santini,et al.  PuRe: Robust pupil detection for real-time pervasive eye tracking , 2017, Comput. Vis. Image Underst..

[3]  Thiago Santini,et al.  Fast and Robust Eyelid Outline and Aperture Detection in Real-World Scenarios , 2017, 2017 IEEE Winter Conference on Applications of Computer Vision (WACV).

[4]  Francisco Javier Vera-Olmos,et al.  DeepEye: Deep convolutional network for pupil detection in real environments , 2018, Integr. Comput. Aided Eng..

[5]  Wolfgang Rosenstiel,et al.  CBF: circular binary features for robust and real-time pupil center detection , 2018, ETRA.

[6]  Takeshi Saitoh,et al.  Effectiveness of Data Augmentation for CNN-Based Pupil Center Point Detection , 2018, 2018 57th Annual Conference of the Society of Instrument and Control Engineers of Japan (SICE).

[7]  Thomas C. Kübler,et al.  Driving with Glaucoma: Task Performance and Gaze Movements , 2015, Optometry and vision science : official publication of the American Academy of Optometry.

[8]  Wolfgang Rosenstiel,et al.  PupilNet v2.0: Convolutional Neural Networks for CPU based real time Robust Pupil Detection , 2017, ArXiv.

[9]  Siddhartha S. Srinivasa,et al.  Eye-Hand Behavior in Human-Robot Shared Manipulation , 2018, 2018 13th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[10]  Ali Farhadi,et al.  YOLO9000: Better, Faster, Stronger , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Francisco Javier Vera-Olmos,et al.  Deconvolutional Neural Network for Pupil Detection in Real-World Environments , 2017, IWINAC.

[12]  Wolfgang Rosenstiel,et al.  BORE: boosted-oriented edge optimization for robust, real time remote pupil center detection , 2018, ETRA.

[13]  Andrea F. Abate,et al.  BIRD: Watershed Based IRis Detection for mobile devices , 2015, Pattern Recognit. Lett..

[14]  Thiago Santini,et al.  Bayesian identification of fixations, saccades, and smooth pursuits , 2015, ETRA.

[15]  Yusuke Sugano,et al.  Self-Calibrating Head-Mounted Eye Trackers Using Egocentric Visual Saliency , 2015, UIST.

[16]  Enkelejda Kasneci,et al.  Pupil detection for head-mounted eye tracking in the wild: an evaluation of the state of the art , 2016, Machine Vision and Applications.

[17]  Peter Robinson,et al.  Rendering of Eyes for Eye-Shape Registration and Gaze Estimation , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[18]  Joanne M Wood,et al.  Night‐time pedestrian conspicuity: effects of clothing on drivers’ eye movements , 2017, Ophthalmic & physiological optics : the journal of the British College of Ophthalmic Opticians.

[19]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  Guang-Zhong Yang,et al.  Differences in gaze behaviour of expert and junior surgeons performing open inguinal hernia repair , 2015, Surgical Endoscopy.

[21]  Takeshi Saitoh,et al.  CNN-Based Pupil Center Detection for Wearable Gaze Estimation System , 2017, Appl. Comput. Intell. Soft Comput..

[22]  Hongjian Shi,et al.  Head Mounted Pupil Tracking Using Convolutional Neural Network , 2018, ArXiv.

[23]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[24]  Neil A. Dodgson,et al.  Robust real-time pupil tracking in highly off-axis images , 2012, ETRA.

[25]  Thiago Santini,et al.  PuReST: robust pupil tracking for real-time pervasive eye tracking , 2018, ETRA.

[26]  W. Rosenstiel,et al.  Driving with Binocular Visual Field Loss? A Study on a Supervised On-Road Parcours with Simultaneous Eye and Head Tracking , 2014, PloS one.

[27]  Qiang Chen,et al.  Network In Network , 2013, ICLR.

[28]  Yusuke Sugano,et al.  Labelled pupils in the wild: a dataset for studying pupil detection in unconstrained environments , 2015, ETRA.

[29]  Gjergji Kasneci,et al.  PupilNet: Convolutional Neural Networks for Robust Pupil Detection , 2016, ArXiv.

[30]  Jürgen Schmidt,et al.  Eye blink detection for different driver states in conditionally automated driving and manual driving using EOG and a driver camera , 2018, Behavior research methods.

[31]  W. Rosenstiel,et al.  Binocular Glaucomatous Visual Field Loss and Its Impact on Visual Exploration - A Supermarket Study , 2014, PloS one.

[32]  Wolfgang Rosenstiel,et al.  ExCuSe: Robust Pupil Detection in Real-World Scenarios , 2015, CAIP.

[33]  Thiago Santini,et al.  ElSe: ellipse selection for robust pupil detection in real-world environments , 2015, ETRA.

[34]  W. Rosenstiel,et al.  Homonymous Visual Field Loss and Its Impact on Visual Exploration: A Supermarket Study. , 2014, Translational vision science & technology.